(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



"" Worul 'TZEZZZ. 0 "**" iniiniiiiiiiiiNi&iifniinHRi 

(43) International Publication Date (10) International Publication Number 

18 December 2003 (18.12.2003) PCT WO 03/104438 A2 



(51) International Patent Classification 7 : C12N 

(21) International Application Number: PCT/US03/ 1 974 1 

(22) International Filing Date: 10 June 2003 (10.06.2003) 

(25) Filing Language: English 

(26) Publication Language: English 



(30) Priority Data: 

60/388,075 



1 1 June 2002 ( 1 1 .06.2002 ) US 



(71) Applicant (for all designated States except US): AVALON 
PHARMACEUTICALS, INC. [US/US]; 20358 Seneca 
Meadows Parkway, Germantown, MD 20876 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): YOUNG, Paul, E. 

[US/US]; 207 Beckwith Street, Gaithersburg, MD 20870 
(US). EBNER, Reinhard [US/US]; 9906 Shelbume Ter- 
race #316, Gaithersburg, MD 20878 (US). 

(74) Agents: GRANT, Alan, J. et aL; Carella, Byrne, Bain, 
Gilfillan, Cecchi, Stewart & Ol stein, 6 Becker Farm Road, 
Roselan, NJ 07068 (US). 



(81) Designated States (national): AE, AG. AL. AM. AT. AU, 

AZ, BA, BB, BG, BR, BY, BZ, CA, CM, CN. CO, CR, CU. 
CZ, DE, DK, DM, DZ. EC, EE, ES. FL GB. GD. GE. Gil, 
GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, KP. KR, KZ. LC. 
LK, LR, LS, LT, LU, LV. MA, MD, MG. MK. MN. MW. 
MX, MZ, NO, NZ, OM, PH. PL, PT, RO. RU, SC, SD, SE, 
SG, SK, SL, TJ, TM, TN, TR, TT, TZ. UA. UG, US, UZ, 
VC, VN, YU, ZA, ZM, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZM, ZW), 
Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM >, 
European patent (AT, BE, BG, CH, CY, CZ, DE, DK, EE, 
ES, FI, FR, GB, GR, HU, IE, IT, LU, MC, NL, PT, RO, 
SE, SI, SK, TR), OAPI patent (BF, BJ, CF, CG, CI, CM, 
GA, GN, GQ, GW, ML, MR, NE, SN, TD, TG). 

Published: 

— without international search report and to be republished 
upon receipt of that report 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



< 
00 

m 



(54) Title: CANCER-LINKED GENE AS TARGET FOR CHEMOTHERAPY 

(57) Abstract: Cancer-linked gene sequences, and derived amino acid sequences, are disclosed along with processes for assaying 
potential antitumor agents based on their modulation of the expression of these cancer-linked genes. Also disclosed are antibodies 
that react with the disclosed polypeptides and methods of using the antibodies to treat cancerous conditions, such as by using the 
antibody to target cancerous cells in vivo for purposes of delivering therapeutic agents thereto. Also described are methods of diag- 
nosing using the gene sequences. 



THIS PAGE BLANK fuspre) 



WO 03/104438 



10/518039 

DT05 IW PCT/PTO 0 ? DEC 20Q4i 

PCT/US03/19741 



10 



15 



CANCER-LINKED GENE AS TARGET FOR 
CHEMOTHERAPY 



This application claims priority of U.S. Provisional Patent Application 
60/388,075, filed 11 June 2002, the disclosure of which is hereby incorporated 
by reference in its entirety. 



FIELD OF THE INVENTION 



The present invention relates to methods of screening cancer-linked 
genes and expression products for involvement in the cancer initiation and 
facilitation process as a means of cancer diagnosis as well as the use of such 
genes for screening potential anti-cancer agents, including small organic 
20 compounds and other molecules, and development of therapeutic agents. 



BACKGROUND OF THE INVENTION 

25 

Cancer-linked genes are valuable in that they indicate genetic 
differences between cancer cells and normal cells, such as where a gene is 
expressed in a cancer cell but not in a non-cancer cell, or where said gene is 
over-expressed or expressed at a higher level in a cancer as opposed to 

30 normal or non-cancer cell. In addition, the expression of such a gene in a 
normal cell but not in a cancer cell, especially of the same type of tissue, can 
indicate important functions in the cancerous process. For example, screening 
assays for novel drugs are based on the response of model cell based 
systems in vitro to treatment with specific compounds. Such genes are also 

35 useful in the diagnosis of cancer and the identification of a cell as cancerous. 
Gene activity is readily measured by measuring the rate of production of gene 
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products, such as RNAs and polypeptides encoded by such genes. Where 
genes encode cell surface proteins, appearance of, or alterations in, such 
proteins, as cell surface markers, are an indication of neoplastic activity. 
Some such screens rely on specific genes, such as oncogenes (or gene 
5 mutations). In accordance with the present invention, a cancer-linked gene 
has been identified and its putative amino acid sequence worked out. Such 
gene is useful in the diagnosing of cancer, the screening of anticancer agents 
and the treatment of cancer using such agents, especially in that these genes 
encode polypeptides that can act as markers, such as cell surface markers, 
10 thereby providing ready targets for anti-tumor agents such as antibodies, 
preferably antibodies complexed to cytotoxic agents, including apoptotic 
agents. 



15 

BRIEF SUMMARY OF THE INVENTION 

In accordance with the present invention, there is provided herein a 
cancer specific gene, linked especially to breast cancer, or otherwise 
20 involved in the cancer initiating and facilitating process and the derived amino 
acid sequence thereof, including a number of different transcripts derived from 
said gene. 

In one aspect, the present invention relates to a process for identifying 
25 an agent that modulates the activity of a cancer-related gene comprising: 

(a) contacting a compound with a cell containing a gene that 
corresponds to a polynucleotide having a sequence selected from the group 
consisting of SEQ ID NO: 1 , 2, 3 and 4 and under conditions promoting the 
expression of said gene; and 
30 (b) detecting a difference in expression of said gene relative to when 

said compound is not present 
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thereby identifying an agent that modulates the activity of a cancer- 
related gene. 

In various embodiments of such a process, the cell is a cancer cell and 
5 the difference in expression is a decrease in expression. Such 
polynucleotides may also include those that have sequences identical to SEQ 
ID NO: 1, 2, 3 and 4. 

In another aspect, the present invention relates to a process for 
10 identifying an antineoplastic agent comprising contacting a cell exhibiting 
neoplastic activity with a compound first identified as a cancer related gene 
modulator using an assay process disclosed herein and detecting a decrease 
in said neoplastic activity after said contacting compared to when said 
contacting does not occur. Such neoplastic activity may include accelerated 
1 5 cellular replication and/or metastasis, and the decrease in neoplastic activity 
preferably results from the death of the cell, or senescence, terminal 
differentiation or growth inhibition. 

The present invention also relates to a process for identifying an anti- 
20 neoplastic agent comprising administering to an animal exhibiting a cancer 
condition an effective amount of an agent first identified according to a 
process of one of one of the assays disclosed according to the invention and 
detecting a decrease in said cancerous condition. 

25 The present invention further relates to a process for determining the 

cancerous status of a cell, comprising determining an increase in the level of 
expression in said cell of at least one gene that corresponds to a 
polynucleotide having a sequence selected from the group consisting of SEQ 
ID NO: 1, 2, 3 and 4 wherein an elevated expression relative to a known non- 
30 cancerous cell indicates a cancerous state or potentially cancerous state. 
Such elevated expression may be due to an increased copy number. 
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The present invention additionally relates to an isolated polypeptide, 
encoded by one of the polynucleotide transcripts disclosed herein, comprising 
an amino acid sequence homologous to an amino acid sequence selected 
from the group consisting of SEQ ID NO: 5 and 6 wherein any difference 
5 between said amino acid sequence and the sequence of SEQ ID NO: 5 and 6 
is due solely to conservative amino acid substitutions and wherein said 
isolated polypeptide comprises at least one immunogenic fragment. In a 
preferred embodiment, the present invention encompasses an isolated 
polypeptide comprising an amino acid sequence homologous to an amino 
1 0 acid sequence selected from the group consisting of SEQ ID NO: 5 and 6. 

The present invention also relates to an antibody that reacts with a 
polypeptide as disclosed herein, preferably a polypeptide comprising an 
amino acid sequence selected from the group consisting of SEQ ID NO: 5 and 
1 5 6. Such an antibody may be polyclonal, monoclonal, recombinant or synthetic 
in origin. 

In one such embodiment, said antibody is associated, either 
covalently or non-covalently, with a cytotoxic agent, for example, an 
20 apoptotic agent. Thus, the present invention relates to an 
immunoconjugate comprising an antibody of the invention and a cytotoxic 
agent. 

The present invention also relates to a process for treating cancer 
25 comprising contacting a cancerous cell with an agent having activity against 
an expression product encoded by a gene sequence selected from the group 
consisting of SEQ ID NO: 1, 2, 3 and 4. In one such embodiment, the 
cancerous cell is contacted in vivo. In another such embodiment, said agent 
has affinity for said expression product. In a preferred embodiment, such 
30 agent is an antibody disclosed herein, such as an antibody that is specific or 
selective for, or otherwise reacts with, a polypeptide of the Invention. In a 
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preferred embodiment, the expression product is a polypeptide incorporating 
an amino acid sequence selected from SEQ ID NO: 5 and 6. 

The present invention further encompasses an immunogenic 
5 composition comprising a polypeptide disclosed herein, as well as 
compositions formed using antibodies specific for these polypeptides. 

The present invention is also directed to uses of such compositions. 
Such uses include a method for treating cancer in an animal afflicted 

10 therewith comprising administering to said animal an amount of an 
immunogenic composition of one or more of the polypeptides disclosed herein 
where such amount is an amount sufficient to elicit the production of cytotoxic 
T lymphocytes specific for a polypeptide of the invention, preferably a 
polypeptide incorporating a sequence of SEQ ID NO: 5 and 6. In a preferred 

1 5 embodiment, the animal to be so treated is a human patient. 



DEFINITIONS 

20 As used herein, the terms "portion," "segment," and "fragment," when 

used in relation to polypeptides, refer to a continuous sequence of residues, 
such as amino acid residues, which sequence forms a subset of a larger 
sequence. For example, if a polypeptide were subjected to treatment with any of 
the common endopeptidases, such as trypsin or chymotrypsin, the oligopeptides 

25 resulting from such treatment would represent portions, segments or fragments 
of the starting polypeptide. When used in relation to a polynucleotides, such 
terms refer to the products produced by treatment of said polynucleotides with 
any of the common endonucleases. 

30 As used herein, the term "isolated" means that the material is removed 

from its original environment (e.g., the natural environment if it is naturally 
occurring). It could also be produced recombinantly and subsequently purified. 
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For example, a naturally-occurring polynucleotide or polypeptide present in a 
living animal is not isolated, but the same polynucleotide or polypeptide, 
separated from some or all of the coexisting materials in the natural system, is 
isolated. Such polynucleotides, for example, those prepared recombinantly, 
5 could be part of a vector and/or such polynucleotides or polypeptides could be 
part of a composition, and still be isolated in that such vector or composition is 
not part of its natural environment. In one embodiment of the present invention, 
such isolated, or purified, polypeptide is useful in generating antibodies for 
practicing the invention, or where said antibody is attached to a cytotoxic or 
1 0 cytolytic agent, such as an apoptotic agent. 

The term "percent identity" or "percent identical," when referring to a 
sequence, means that a sequence is compared to a claimed or described 
sequence after alignment of the sequence to be compared (the "Compared 
1 5 Sequence") with the described or claimed sequence (the "Reference 
Sequence"). The Percent Identity is then determined according to the following 
formula: 

Percent Identity = 100 [1-(C/R)] 

20 

wherein C is the number of differences between the Reference Sequence and 
the Compared Sequence over the length of alignment between the Reference 
Sequence and the Compared Sequence wherein (i) each base or amino acid in 
the Reference Sequence that does not have a corresponding aligned base or 

25 amino acid in the Compared Sequence and (ii) each gap in the Reference 
Sequence and (iii) each aligned base or amino acid in the Reference Sequence 
that is different from an aligned base or amino acid in the Compared Sequence, 
constitutes a difference; and R is the number of bases or amino acids in the 
Reference Sequence over the length of the alignment with the Compared 

30 Sequence with any gap created in the Reference Sequence also being counted 
as a base or amino acid. 
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If an alignment exists between the Compared Sequence and the 
Reference Sequence for which the percent identity as calculated above is about 
equal to or greater than a specified minimum Percent Identity then the 
Compared Sequence has the specified minimum percent identity to the 
5 Reference Sequence even though alignments may exist in which the 
hereinabove calculated Percent Identity is less than the specified Percent 
Identity. 

As known in the art "similarity" between two polypeptides is determined 
1 0 by comparing the amino acid sequence and its conserved amino acid 
substitutes of one polypeptide to the sequence of a second polypeptide. 

In accordance with the present invention, the term "DNA segment" or 
"DNA sequence" refers to a DNA polymer, in the form of a separate fragment 

15 or as a component of a larger DNA construct, which has been derived from 
DNA isolated at least once in substantially pure form, i.e., free, of 
contaminating endogenous materials and in a quantity or concentration 
enabling identification, manipulation, and recovery of the segment and its 
component nucleotide sequences by standard biochemical methods, for 

20 example, using a cloning vector. Such segments are provided in the form of 
an open reading frame uninterrupted by internal nontranslated sequences, or 
introns, which are typically present in eukaryotic genes. Sequences of non- 
translated DNA may be present downstream from the open reading frame, 
where the same do not interfere with manipulation or expression of the coding 

25 regions. 

The term "coding region" refers to that portion of a gene which either 
naturally or normally codes for the expression product of that gene in its 
natural genomic environment, i.e., the region coding in vivo for the native 
30 expression product of the gene. The coding region can be from a normal, 
mutated or altered gene, or can even be from a DNA sequence, or gene, 
wholly synthesized in the laboratory using methods well known to those of 
skill in the art of DNA synthesis. 
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In accordance with the present invention, the term "nucleotide 
sequence" refers to a heteropolymer of deoxyribonucleotides. Generally, DNA 
segments encoding the proteins provided by this invention are assembled 
5 from cDNA fragments and short oligonucleotide linkers, or from a series of 
oligonucleotides, to provide a synthetic gene which is capable of being 
expressed in a recombinant transcriptional unit comprising regulatory 
elements derived from a microbial, eukaryotic or viral operon. 

1 0 The term "expression product" means that polypeptide or protein that is 

the natural translation product of the gene and any nucleic acid sequence 
coding equivalents resulting from genetic code degeneracy and thus coding 
for the same amino acid(s). 

1 5 The term "active fragment," when referring to a coding sequence, means 

a portion comprising less than the complete coding region whose expression 
product retains essentially the same biological function or activity as the 
expression product of the complete coding region. 

The term "primer" means a short nucleic acid sequence that is paired 

20 with one strand of DNA and provides a free 3'-OH end at which a DNA 
polymerase starts synthesis of a deoxyribonucleotide chain. 

The term "promoter" means a region of DNA involved in binding of RNA 
polymerase to initiate transcription. The term "enhancer" refers to a region of 
25 DNA that, when present and active, has the effect of increasing expression of 
a different DNA sequence that is being expressed, thereby increasing the 
amount of expression product formed from said different DNA sequence. 

The term "open reading frame (ORF)" means a series of triplets coding 
30 for amino acids without any termination codons and is a sequence 
(potentially) translatable into protein. 
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As used herein, reference to a "DNA sequence" includes both single 
stranded and double stranded DNA. Thus, the specific sequence, unless the 
context indicates otherwise, refers to the single strand DNA of such 
sequence, the duplex of such sequence with its complement (double stranded 
5 DNA) and the complement of such sequence. 

As used herein, "corresponding genes" refers to genes that encode an 
RNA that is at least 90% identical, preferably at least 95% identical, most 
preferably at least 98% identical, and especially identical, to an RNA encoded 

10 by one of the nucleotide sequences disclosed herein (i.e., SEQ ID NO: 1, 2, 3 
and 4). Such genes will also encode the same polypeptide sequence as any 
of the sequences disclosed herein, preferably SEQ ID NO: 1, 2, 3 and 4, but 
may include differences in such amino acid sequences where such 
differences are limited to conservative amino acid substitutions, such as 

15 where the same overall three dimensional structure, and thus the same 
antigenic character, is maintained. Thus, amino acid sequences may be within 
the scope of the present invention where they react with the same antibodies 
that react with polypeptides comprising the sequences of SEQ ID NO: 5 and 
6. A "corresponding gene" includes splice variants thereof. 

20 

The genes identified by the present disclosure are considered "cancer- 
related" genes, as this term is used herein, and include genes expressed at 
higher levels (due, for example, to elevated rates of expression, elevated 
extent of expression or increased copy number) in cancer cells relative to 

25 expression of these genes in normal (i.e., non-cancerous) cells where said 
cancerous state or status of test cells or tissues has been determined by 
methods known in the art, such as by reverse transcriptase polymerase chain 
reaction (RT-PCR) as described in the Examples herein. In specific 
embodiments, this relates to the genes whose sequences correspond to the 

30 sequences of SEQ ID NO: 1, 2, 3 and 4. 
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As used herein, the term "conservative amino acid substitutions" are 
defined herein as exchanges within one of the following five groups: 
I. Small aliphatic, nonpolar or slightly polar residues: 
Ala, Ser, Thr, Pro, Gly; 
5 II. Polar, negatively charged residues and their amides: 

Asp, Asn, Glu, Gin; 

III. Polar, positively charged residues: 

His, Arg, Lys; 

IV. Large, aliphatic, nonpolar residues: 
1 0 Met Leu, He, Val, Cys 

V. Large, aromatic residues: 

Phe, Tyr, Trp 



15 

DETAILED SUMMARY OF THE INVENTION 

The present invention relates to processes for utilizing a nucleotide 
20 sequence for a cancer-linked gene, polypeptides encoded by such sequences 
and antibodies reactive with such polypeptides in methods of treating and 
diagnosing cancer, preferably breast cancer, and in carrying out screening 
assays for agents effective in reducing the activity of cancer-linked genes and 
thereby treating a cancerous condition. 

25 

The sequences disclosed herein incorporate various polynucleotide 
transcripts (SEQ ID NO: 1, 2, 3 and 4) and the derived amino acid sequence 
(SEQ ID NO: 5 and 6) from said transcripts are available as targets for 
chemotherapeutic agents, especially anti-cancer agents, including antibodies 
30 specific for said polypeptides. The transcript for this gene is up in breast 
cancer and should encode a type I transmembrane protein with a large 
immunogenic extracellular domain. The gene disclosed herein is especially 



10 



WO 03/104438 




PCT/US03/19741 



characteristic of breast cancer, preferably Wilms tumor and soft tissue 
fibromatosis. 



The cancer-related polynucleotide sequences disclosed herein 
5 correspond to gene sequences whose expression is indicative of the 
cancerous status of a given cell. Such sequences are substantially identical to 
SEQ ID NO: 1, 2, 3 and 4, which represent different transcripts identified from 
the GenBank EST database and which exhibit cancer-specific expression. 
The polynucleotides of the invention are those that correspond to a sequence 
10 of SEQ ID NO: 1, 2, 3 and 4. Such sequences have been searched within the 
GenBank database, especially the EST database, with the following results: 



15 



20 



25 



Type: 
Tissue: 

Aff yFragment-ID(s) : 

Accession(s): 
Unigene cluster-ID(s): 
Chromosomal location: 



cell-surface tumor antigen 
therapeutic antibody target 

breast cancer, 

soft tissue fibromatosis, Wilms tumor 

124040, 143095, 156021 
KIAA1497 protein 

AI885003, AI857396, AI015776 

Hs. 126085 



The nucleotides and polypeptides, as gene products, used in the 
30 processes of the present invention may comprise a recombinant polynucleotide 
or polypeptide, a natural polynucleotide or polypeptide, or a synthetic 
polynucleotide or polypeptide, or a recombinant polynucleotide or polypeptide. 

Fragments of such polynucleotides and polypeptides as are disclosed 
35 herein may also be useful in practicing the processes of the present invention. 
For example, a fragment, derivative or analog of the polypeptide (SEQ ID NO: 5 
and 6) may be (i) one in which one or more of the amino acid residues are 
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substituted with a conserved or non-conserved amino acid residue (preferably a 
conserved amino acid residue) and such substituted amino acid residue may or 
may not be one encoded by the genetic code, or (ii) one in which one or more of 
the amino acid residues includes a substituent group, or (iii) one in which the 
5 mature polypeptide is fused with another compound, such as a compound to 
increase the half-life of the polypeptide (for example, polyethylene glycol), or (iv) 
one in which the additional amino acids are fused to the mature polypeptide, 
such as a leader or secretory sequence or a sequence which is employed for 
purification of the mature polypeptide (such as a histidine hexapeptide) or a 
1 0 proprotein sequence. Such fragments, derivatives and analogs are deemed to 
be within the scope of those skilled in the art from the teachings herein. 

In another aspect, the present invention relates to an isolated 
polypeptide, including a purified polypeptide, comprising an amino acid 

1 5 sequence at least 90% identical to the amino acid sequence of SEQ ID NO: 5 
and/or 6. In preferred embodiments, said isolated polypeptide comprises an 
amino acid sequence having sequence identity of at least 95%, preferably at 
least about 98%, and especially is identical to, the sequence of SEQ ID NO: 5 
and/or 6. The present invention also includes isolated active fragments of such 

20 polypeptides where said fragments retain the biological activity of the 
polypeptide or where such active fragments are useful as specific targets for 
cancer treatment, prevention or diagnosis. Thus, the present invention relates to 
any polypeptides, or fragments thereof, with sufficient sequence homology to 
the sequences disclosed herein as to be useful in the production of antibodies 

25 that react with (i.e., are selective or specific for) the polypeptides of SEQ ID NO: 
5 and 6 so as to be useful in targeting cells that exhibit such polypeptides, or 
fragments, on their surfaces, thereby providing targets for such antibodies and 
therapeutic agents associated with such antibodies. 

30 The polynucleotides and polypeptides useful in practicing the processes 

of the present invention may likewise be obtained in an isolated or purified form. 
In addition, the polypeptide disclosed herein as being useful in practicing the 
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processes of the invention are believed to be surface proteins present on cells, 
such as cancerous cells. Precisely how such cancer-linked proteins are used in 
the processes of the invention may thus differ depending on the therapeutic 
approach used. For example, cell-surface proteins, such as receptors, are 
5 desirable targets for cytotoxic antibodies that can be generated against the 
polypeptides disclosed herein. 

The sequence information disclosed herein, as derived from the 
GenBank submissions, can readily be utilized by those skilled in the art to 
10 prepare the corresponding full-length polypeptide by peptide synthesis. The 
same is true for either the polynucleotides or polypeptides disclosed herein for 
use in the methods of the invention. 

The present invention relates to an isolated polypeptide, encoded by 
1 5 one of the polynucleotide transcripts disclosed herein, comprising an amino 
acid sequence homologous to an amino acid sequence selected from the 
group consisting of SEQ ID NO: 5 and 6, wherein any difference between 
amino acid sequence in the isolated polypeptide and the sequence of SEQ ID 
NO: 5 and 6 is due solely to conservative amino acid substitutions and 
20 wherein said isolated polypeptide comprises at least one immunogenic 
fragment. In a preferred embodiment, the present invention encompasses an 
isolated polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 5 and 6. 

25 Methods of producing recombinant cells and vectors useful in 

preparing the polynucleotides and polypeptides disclosed herein are well 
known to those skilled in the molecular biology art. See, for example, 
Sambrook, et al., Molecular Cloning: A Laboratory Manual, Second Edition, Cold 
Spring Harbor, N.Y., (1989), Wu et al., Methods in Gene Biotechnology (CRC 

30 Press, New York, NY, 1997), and Recombinant Gene Expression Protocols, 
in Methods in Molecular Biology, Vol. 62, (Tuan, ed., Humana Press, Totowa, 
NJ, 1997), the disclosures of which are hereby incorporated by reference. 
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In one aspect, the present invention relates to a process for identifying 
an agent that modulates the activity of a cancer-related gene comprising: 

(a) contacting a compound with a cell containing a gene that 
corresponds to a polynucleotide having a sequence selected from the group 

5 consisting of SEQ ID NO: 1, 2, 3 and 4 and under conditions promoting the 
expression of said gene; and 

(b) detecting a difference in expression of said gene relative to when 
said compound is not present 

thereby identifying an agent that modulates the activity of a cancer- 
1 0 related gene. 

In specific embodiments of such process the cell is a cancer cell and 
the difference in expression is a decrease in expression. Such 
polynucleotides may also include those that have sequences identical to SEQ 
15 ID NO: 1,2, 3 and 4. 

In another aspect, the present invention relates to a process for 
identifying an anti-neoplastic agent comprising contacting a cell exhibiting 
neoplastic activity with a compound first identified as a cancer related gene 
20 modulator using an assay process disclosed herein and detecting a decrease 
in said neoplastic activity after said contacting compared to when said 
contacting does not occur. Such neoplastic activity may include accelerated 
cellular replication and/or metastasis, and the decrease in neoplastic activity 
preferably results from the death of the cell. 

25 

The present invention also relates to a process for identifying an anti- 
neoplastic agent comprising administering to an animal exhibiting a cancer 
condition an effective amount of an agent first identified according to a 
process of one of one of the assays disclosed according to the invention and 
30 detecting a decrease in said cancerous condition. 
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In specific embodiments of the present invention, the genes useful for 
the invention comprise genes that correspond to polynucleotides having a 
sequence selected from SEQ ID NO: 1,2, 3 and 4, or may comprise the 
sequence of any of the polynucleotides disclosed herein (where the latter are 
5 cDNA sequences). 

In accordance with the present invention, such assays rely on methods 
of determining the activity of the gene in question. Such assays are 
advantageously based on model cellular systems using cancer cell lines, 

10 primary cancer cells, or cancerous tissue samples that are maintained in 
growth medium and treated with compounds at a single concentration or at a 
range of concentrations. At specific times after treatment, cellular RNAs are 
conveniently isolated from the treated cells or tissues, which RNAs are 
indicative of expression of selected genes. The cellular RNA is then divided 

1 5 and subjected to differential analysis that detects the presence and/or quantity 
of specific RNA transcripts, which transcripts may then be amplified for 
detection purposes using standard methodologies, such as, for example, 
reverse transcriptase polymerase chain reaction (RT-PCR), etc. The presence 
or absence, or concentration levels, of specific RNA transcripts are 

20 determined from these measurements. The polynucleotide sequences 
disclosed herein are readily used as probes for the detection of such RNA 
transcripts and thus the measurement of gene activity and expression. 

The polynucleotides of the invention can include fully operational genes 
25 with attendant control or regulatory sequences or merely a polynucleotide 
sequence encoding the corresponding polypeptide or an active fragment or 
analog thereof. 

Because expression of the polynucleotide sequences disclosed herein 
30 are specific to the cancerous state, useful gene modulation is downward 
modulation, so that, as a result of exposure to an antineoplastic agent 
identified by the screening assays herein, the corresponding gene of the 
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cancerous cell is expressed at a lower level (or not expressed at all) when 
exposed to the agent as compared to the expression when not exposed to the 
agent. For example, the gene sequences disclosed herein (SEQ ID NO: 1, 2, 
3 and 4) correspond to a gene expressed at a higher level in cells of breast 
5 cancer than in normal breast cells. Thus, where said chemical agent causes 
this gene of the tested cell to be expressed at a lower level than the same 
genes of the reference, this is indicative of downward modulation and 
indicates that the chemical agent to be tested has anti-neoplastic activity. 

1 o In carrying out the assays disclosed herein, relative antineoplastic activity 

may be ascertained by the extent to which a given chemical agent modulates 
the expression of genes present in a cancerous cell. Thus, a first chemical agent 
that modulates the expression of a gene associated with the cancerous state 
(i.e., a gene corresponding to one or more of the polynucleotide transcripts 

1 5 disclosed herein) to a larger degree than a second chemical agent tested by the 
assays of the invention is thereby deemed to have higher, or more desirable, or 
more advantageous, anti-neoplastic activity than said second chemical agent. 

The gene expression to be measured is commonly assayed using RNA 
20 expression as an indicator. Thus, the greater the level of RNA (for example, 
messenger RNA or mRNA) detected the higher the level of expression of the 
corresponding gene. Thus, gene expression, either absolute or relative, is 
determined by the relative expression of the RNAs encoded by such genes. 

25 RNA may be isolated from samples in a variety of ways, including lysis 

and denaturation with a phenolic solution containing a chaotropic agent (e.g., 
trizol) followed by isopropanol precipitation, ethanol wash, and resuspension in 
aqueous solution; or lysis and denaturation followed by isolation on solid 
support, such as a Qiagen resin and reconstitution in aqueous solution; or lysis 

30 and denaturation in non-phenolic, aqueous solutions followed by enzymatic 
conversion of RNA to DNA template copies. 
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Normally, prior to applying the processes of the invention, steady state 
RNA expression levels for the genes, and sets of genes, disclosed herein will 
have been obtained. It is the steady state level of such expression that is 
affected by potential anti-neoplastic agents as determined herein. Such steady 
5 state levels of expression are easily determined by any methods that are 
sensitive, specific and accurate. Such methods include, but are in no way limited 
to, real time quantitative polymerase chain reaction (PCR), for example, using a 
Perkin-Elmer 7700 sequence detection system with gene specific primer probe 
combinations as designed using any of several commercially available software 
10 packages, such as Primer Express software., solid support based hybridization 
array technology using appropriate internal controls for quantitation, including 
filter, bead, or microchip based arrays, solid support based hybridization arrays 
using, for example, chemiiuminescent, fluorescent, or electrochemical reaction 
based detection systems. 

15 

The gene expression indicative of a cancerous state need not be 
characteristic of every cell of a given tissue. Thus, the methods disclosed 
herein are useful for detecting the presence of a cancerous condition within a 
tissue where less than all cells exhibit the complete pattern. Thus, for 

20 example, a selected gene corresponding to the sequence of SEQ ID NO: 1, 
may be found, using appropriate probes, either DNA or RNA, to be present in 
as little as 60% of cells derived from a sample of tumorous, or malignant, 
tissue. In a highly preferred embodiment, such gene pattern is found to be 
present in at least 100% of cells drawn from a cancerous tissue and absent 

25 from at least 100% of a corresponding normal, non-cancerous, tissue sample. 

Expression of a gene may be related to copy number, and changes in 
expression may be measured by determining copy number. Such change in 
gene copy number may be determined by determining a change in expression 
30 of messenger RNA encoded by a particular gene sequence, especially that of 
SEQ ID NO: 1, 2, 3 and 4. Also in accordance with the present invention, said 
gene may be a cancer initiating or facilitating gene. In carrying out the 
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methods of the present invention, a cancer facilitating gene is a gene that, 
while not directly initiating tumor formation or growth, acts, such as through 
the actions of its expression product, to direct, enhance, or otherwise facilitate 
the progress of the cancerous condition, including where such gene acts 
5 against genes, or gene expression products, that would otherwise have the 
effect of decreasing tumor formation and/or growth. 

Although the expression of a gene corresponding to a sequence of 
SEQ ID NO: 1, 2, 3 and 4 may be indicative of a cancerous status for a given 

1 0 cell, the mere presence of such a gene may not alone be sufficient to achieve 
a malignant condition and thus the level of expression of such gene may also 
be a significant factor in determining the attainment of a cancerous state. 
Thus, it becomes essential to also determine the level of expression of a gene 
as disclosed herein, including substantially similar sequences, as a separate 

1 5 means of diagnosing the presence of a cancerous status for a given cell, 
groups of cells, or tissues, either in culture or in situ. 

The level of expression of the polypeptides disclosed herein is also a 
measure of gene expression, such as polypeptides having sequence identical, 
20 or similar to, any polypeptide encoded by a sequence of SEQ ID NO: 1 , 2, 3 
and 4, especially a polypeptide whose amino acid sequence is the sequence 
of SEQ ID NO: 5 and 6. 

In accordance with the foregoing, the present invention specifically 
25 contemplates a method for determining the cancerous status of a cell to be 
tested, comprising determining the level of expression in said cell of a gene 
that includes one of the nucleotide sequences selected from the sequences of 
SEQ ID NO: 1, 2, 3 and 4, including sequences substantially identical to said 
sequences, or characteristic fragments thereof, or the complements of any of 
30 the foregoing and then comparing said expression to that of a cell known to 
be non-cancerous whereby the difference in said expression indicates that 
said cell to be tested is cancerous. 
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In accordance with the invention, although gene expression for a gene 
that includes as a portion thereof one of the sequences of SEQ ID NO: 1,2,3 
and 4, is preferably determined by use of a probe that is a fragment of such 
5 nucleotide sequence, it is to be understood that the probe may be formed 
from a different portion of the gene. Expression of the gene may be 
determined by use of a nucleotide probe that hybridizes to messenger RNA 
(mRNA) transcribed from a portion of the gene other than the specific 
nucleotide sequence disclosed herein. 

10 

It should be noted that there are a variety of different contexts in which 
genes have been evaluated as being involved in the cancerous process. 
Thus, some genes may be oncogenes and encode proteins that are directly 
involved in the cancerous process and thereby promote the occurrence of 

1 5 cancer in an animal. In addition, other genes may serve to suppress the 
cancerous state in a given cell or cell type and thereby work against a 
cancerous condition forming in an animal. Other genes may simply be 
involved either directly or indirectly in the cancerous process or condition and 
may serve in an ancillary capacity with respect to the cancerous state. All 

20 such types of genes are deemed with those to be determined in accordance 
with the invention as disclosed herein. Thus, the gene determined by said 
process of the invention may be an oncogene, or the gene determined by said 
process may be a cancer facilitating gene, the latter including a gene that 
directly or indirectly affects the cancerous process, either in the promotion of a 

25 cancerous condition or in facilitating the progress of cancerous growth or 
otherwise modulating the growth of cancer cells, either in vivo or ex vivo. In 
addition, the gene determined by said process may be a cancer suppressor 
gene, which gene works either directly or indirectly to suppress the initiation or 
progress of a cancerous condition. Such genes may work indirectly where 

30 their expression alters the activity of some other gene or gene expression 
product that is itself directly involved in initiating or facilitating the progress of 
a cancerous condition. For example, a gene that encodes a polypeptide, 



19 



WO 03/104438 




PCT/US03/19741 



either wild or mutant in type, which polypeptide acts to suppress of tumor 
suppressor gene, or its expression product, will thereby act indirectly to 
promote tumor growth. 

5 As noted previously, polynucleotides encoding the same proteins as 

any of SEQ ID NO: 1, 2, 3 and 4, regardless of the percent identity of such 
sequences, are also specifically contemplated by any of the methods of the 
present invention that rely on any or all of said sequences, regardless of how 
they are otherwise described or limited. Thus, any such sequences are 
1 0 available for use in carrying out any of the methods disclosed according to the 
invention. Such sequences also include any open reading frames, as defined 
herein, present within the sequence of SEQ ID NO: 1 , 2, 3 and 4. 

Because a gene disclosed according to the invention "corresponds to" 

1 5 a polynucleotide having a sequence of SEQ ID NO: 1, 2, 3 and 4, said gene 
encodes an RNA (processed or unprocessed, including naturally occurring 
splice variants and alleles) that is at least 90% identical, preferably at least 
95% identical, most preferably at least 98% identical to, and especially 
identical to, an RNA that would be encoded by, or be complementary to, such 

20 as by hybridization with, a polynucleotide having the indicated sequence. In 
addition, genes including sequences at least 90% identical to a sequence 
selected from SEQ ID NO: 1, 2, 3 and 4, preferably at least about 95% 
identical to such a sequence, more preferably at least about 98% identical to 
such sequence and most preferably comprising such sequence are 

25 specifically contemplated by all of the processes of the present invention. 
Sequences encoding the same proteins as any of these sequences, 
regardless of the percent identity of such sequences, are also specifically 
contemplated by any of the methods of the present invention that rely on any 
or all of said sequences, regardless of how they are otherwise described or 

30 limited. The polynucleotide sequences of the invention also include any open 
reading frames, as defined herein, present within any of the sequences of 
SEQ ID NO: 1,2, 3 and 4. 
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The sequences disclosed herein may be genomic in nature and thus 
represent the sequence of an actual gene, such as a human gene, or may be 
a cDNA sequence derived from a messenger RNA (mRNA) and thus 
5 represent contiguous exonic sequences derived from a corresponding 
genomic sequence, or they may be wholly synthetic in origin for purposes of 
practicing the processes of the invention. Because of the processing that may 
take place in transforming the initial RNA transcript into the final mRNA, the 
sequences disclosed herein may represent less than the full genomic 

1 0 sequence. They may also represent sequences derived from ribosomal and 
transfer RNAs. Consequently, the gene as present in the cell (and 
representing the genomic sequence) and the polynucleotide transcripts 
disclosed herein, including cDNA sequences, may be identical or may be 
such that the cDNAs contain less than the full genomic sequence. Such 

1 5 genes and cDNA sequences are still considered "corresponding sequences" 
(as defined elsewhere herein) because they both encode the same or related 
RNA sequences (i.e., related in the sense of being splice variants or RNAs at 
different stages of processing). Thus, by way of non-limiting example only, a 
gene that encodes an RNA transcript, which is then processed into a shorter 

20 mRNA, is deemed to encode both such RNAs and therefore encodes an RNA 
complementary to (using the usual Watson-Crick complementarity rules), or 
that would otherwise be encoded by, a cDNA (for example, a sequence as 
disclosed herein). Thus, the sequences disclosed herein correspond to genes 
contained in the cancerous cells (here, breast cancer) and are used to 

25 determine gene activity or expression because they represent the same 
sequence or are complementary to RNAs encoded by the gene. Such a gene 
also includes different alleles and splice variants that may occur in the cells 
used in the methods of the invention, such as where recombinant cells are 
used to assay for antineoplastic agents and such cells have been engineered 

30 to express a polynucleotide as disclosed herein, including cells that have 
been engineered to express such polynucleotides at a higher level than is 
found in non-engineered cancerous cells or where such recombinant cells 
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express such polynucleotides only after having been engineered to do so. 
Such engineering includes genetic engineering, such as where one or more of 
the polynucleotides disclosed herein has been inserted into the genome of 
such cell or is present in a vector. 

5 

Such cells, especially mammalian cells, may also be engineered to 
express on their surfaces one or more of the polypeptides of the invention for 
testing with antibodies or other agents capable of masking such polypeptides 
and thereby removing the cancerous nature of the cell. Such engineering 
1 0 includes both genetic engineering, where the genetic complement of the cells 
is engineered to express the polypeptide, as well as non-genetic engineering, 
whereby the cell has been physically manipulated to incorporate a polypeptide 
of the invention in its plasma membrane, such as by direct insertion using 
chemical and/or other agents to achieve this result 

15 

In accordance with the foregoing, the present invention includes anti- 
cancer agents that are themselves either polypeptides, or small chemical 
entities, that affect the cancerous process, including initiation, suppression or 
facilitation of tumor growth, either in vivo or ex vivo. Said cancer modulating 
20 agent will have the effect of decreasing gene expression. 

The present invention thus also relates to a method for treating cancer 
comprising contacting a cancerous cell with an agent having activity against 
an expression product encoded by a gene or polynucleotide sequence as 

25 disclosed herein, such as one having, or corresponding to, the nucleotide 
sequence of SEQ ID NO: 1, 2, 3 and 4. The present invention also relates to a 
process for treating cancer comprising contacting a cancerous cell with an 
agent having activity against an expression product encoded by a gene or 
polynucleotide sequence corresponding to a sequence selected from the 

30 group consisting of SEQ ID NO: 1, 2, 3 and 4. In one such embodiment, the 
cancerous cell is contacted in vivo. In another such embodiment, said agent 
has affinity for said expression product Ih a preferred embodiment, such 
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agent is an antibody disclosed herein, such as an antibody that is specific or 
selective for, or otherwise reacts with, a polypeptide of the invention. In a 
preferred embodiment, the expression product is a polypeptide incorporating 
an amino acid sequence selected from SEQ ID NO: 5 and 6. 

5 

The present invention is also directed to such uses of the compositions 
of polypeptides and antibodies disclosed herein. Such uses include a process 
for treating cancer in an animal afflicted therewith comprising administering to 
said animal an amount of an immunogenic composition of one or more of the 
10 polypeptides disclosed herein where such amount if an amount sufficient to 
elicit the production of cytotoxic T lymphocytes specific for a polypeptide of 
the invention, preferably a polypeptide incorporating a sequence of SEQ ID 
NO: 5 and 6. In a preferred embodiment, the animal to be so treated is a 
human patient 

15 

The proteins encoded by the genes disclosed herein due to their 
expression, or elevated expression, in cancer cells, represent highly useful 
therapeutic targets for "targeted therapies" utilizing such affinity structures as, 
for example, antibodies coupled to some cytotoxic agent. In such 

20 methodology, it is advantageous that nothing need be known about the 
endogenous ligands or binding partners for such cell surface molecules. 
Rather, an antibody or equivalent molecule that can specifically recognize the 
cell surface molecule (which could include an artificial peptide, a surrogate 
ligand, and the like) that is coupled to some agent that can induce cell death 

25 or a block in cell cycling offers therapeutic promise against these proteins. 
Thus, such approaches include the use of so-called suicide "bullets" against 
intracellular proteins. For example, monoclonal antibodies may readily by 
produced by methods well known in the art, for example, the method of Kohler 
and Milstein (see: Nature, 256:495 (1975). 

30 

With the advent of methods of molecular biology and recombinant 
technology, it is now possible to produce antibody molecules by recombinant 
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means and thereby generate gene sequences that code for specific amino 
acid sequences found in the polypeptide structure of the antibodies. Such 
antibodies can be produced by either cloning the gene sequences encoding 
the polypeptide chains of said antibodies or by direct synthesis of said 
5 polypeptide chains, with in vitro assembly of the synthesized chains to form 
active tetrameric (H 2 L 2 ) structures with affinity for specific epitopes and 
antigenic determinants. This has permitted the ready production of antibodies 
having sequences characteristic of neutralizing antibodies from different 
species and sources. 

10 Regardless of the source of the antibodies, or how they are 

recombinantly constructed, or how they are synthesized, in vitro or in vivo, 
using transgenic animals, such as cows, goats and sheep, using large cell 
cultures of laboratory or commercial size, in bioreactors or by direct chemical 
synthesis employing no living organisms at any stage of the process, all 

1 5 antibodies have a similar overall 3 dimensional structure. This structure is 
often given as H 2 L 2 and refers to the fact that antibodies commonly comprise 
2 light (L) amino acid chains and 2 heavy (H) amino acid chains. Both chains 
have regions capable of interacting with a structurally complementary 
antigenic target The regions interacting with the target are referred to as 

20 "variable" or "V" regions and are characterized by differences in amino acid 
sequence from antibodies of different antigenic specificity. 

The variable regions of either H or L chains contains the amino acid 
sequences capable of specifically binding to antigenic targets. Within these 
25 sequences are smaller sequences dubbed "hypervariable" because of their 
extreme variability between antibodies of differing specificity. Such 
hypervariable regions are also referred to as "complementarity determining 
regions" or "CDR" regions. These CDR regions account for the basic 
specificity of the antibody for a particular antigenic determinant structure. 

30 

The CDRs represent non-contiguous stretches of amino acids within 
the variable regions but, regardless of species, the positional locations of 
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these critical amino acid sequences within the variable heavy and light chain 
regions have been found to have similar locations within the amino acid 
sequences of the variable chains. The variable heavy and light chains of all 
antibodies each have 3 CDR regions, each non-contiguous with the others 
5 (termed L1, L2, L3, H1, H2, H3) for the respective light (L) and heavy (H) 
chains. The accepted CDR regions have been described by Kabat et al., J. 
Biol. Chem. 252:6609-6616(1977). 

In all mammalian species, antibody polypeptides contain constant (i.e., 
1 0 highly conserved) and variable regions, and, within the latter, there are the 
CDRs and the so-called "framework regions" made up of amino acid 
sequences within the variable region of the heavy or light chain but outside 
the CDRs. 

1 5 The antibodies disclosed according to the invention may also be wholly 

synthetic, wherein the polypeptide chains of the antibodies are synthesized 
and, possibly, optimized for binding to the polypeptides disclosed herein as 
being receptors. Such antibodies may be chimeric or humanized antibodies 
and may be fully tetrameric in structure, or may be dimeric and comprise only 

20 a single heavy and a single light chain. Such antibodies may also include 
fragments, such as Fab and F(ab 2 )' fragments, capable of reacting, with and 
binding to any of the polypeptides disclosed herein as being receptors. 

In one aspect, the present invention relates to immunoglobulins, or 
25 antibodies, as described herein, that react with, especially where they are 
specific for, the polypeptides having amino acid sequences as disclosed 
herein, preferably those having an amino acid sequence of one of SEQ ID 
NO: 5 and 6. Such antibodies may commonly be in the form of a composition, 
especially a pharmaceutical composition. Such antibodies, by themselves, 
30 may have therapeutic value in that they are able to bind to, and thereby tie up, 
surface sites on cancerous cells. Where such sites have some type of 
function to perform (i.e., where they are surface enzymes, or channel 
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structures, or structures that otherwise facilitate, actively or passively, the 
transport of nutrients and other vital materials to the cell. Such nutrients serve 
to facilitate the growth and replication of the cell and molecules that bind to 
such sites and thereby interfere with such activities can prove to have a 
5 therapeutic effect in that the result of such binding is to remove sources of 
nutrients from such cells, thereby interfering with growth and replication. In 
like manner, such binding may serve to remove vital enzyme activities from 
the cell's functional repertoire, thereby also interfering with viability and/or the 
ability of the cell to multiply or metastasize. In addition, by binding to such 

1 0 surface sites, the antibodies may serve to prevent the cells from reacting to 
environmental agents, such as cytokines and the like, that may facilitate 
growth, replication and metastasis, thereby further reducing the cancerous 
status of such cell and ameliorating the cancerous condition in a patient, even 
without proving fatal to the cell or cells so affected. 

1 5 The methods of the present invention also include processes wherein 

the cancer cell is contacted in vivo as well as ex vivo with an agent that 
comprises a portion, or is part of an overall molecular structure, having affinity 
for an expression product of a gene corresponding to a polynucleotide 
sequence as disclosed herein, preferably where the expression product is a 

20 cell surface structure, most preferably a polypeptide as disclosed herein, such 
as one that comprises an amino acid sequence of SEQ ID NO: 5 and 6. In 
one such embodiment, said portion having affinity for said expression product 
is an antibody, especially where said expression product is a polypeptide or 
oligopeptide or comprises an oligopeptide portion, or comprises a polypeptide. 

25 

In another aspect, the present invention also relates to an antibody that 
reacts with a polypeptide as disclosed herein, preferably a polypeptide 
comprising an amino acid sequence selected from the group consisting of 
SEQ ID NO: 5 and 6. Such an antibody may be polyclonal, monoclonal, 
30 recombinant or synthetic in origin. In one such embodiment, said antibody is 
associated, either covalently or non-covalently, with a cytotoxic agent, for 
example, an apoptotic agent. It is thus contemplated that the antibody acts a 
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targeted vector for guiding an associated therapeutic agent to a cancerous 
cell, such as a cell expressing a polypeptide homologous to, if not identical to, 
a polypeptide as disclosed herein. 

5 Where the cytotoxic agent is itself a polypeptide, said may be linked 

directly to an antibody specific for a surface target on a cancer cell, such as 
where the polypeptide represents an extension of the amino acid chain of the 
antibody. In alternative embodiments, such molecules may be covalently 
linked through a linker sequence of long or short duration, such as an amino 

10 acid sequence of 5 to 10 residues in length. Where the cytotoxic agents is 
some small organic molecule, such as a small organic compound, or some 
type of apoptotic agent, this may be covalently bonded to the antibody 
molecule or may be attached by some other type of non-covalent linkage, 
including hydrophobic and electrostatic linkages. Methods for forming such 

1 5 linkages, especially covalent linkages, are well known to those skilled in the 
art. 

The antibodies disclosed herein may also serve as targeting vectors for 
much larger structures, such as liposomes. In one such embodiment, an 

20 antibody is part of, or otherwise linked to, or associated with, a membranous 
structure, preferably a liposome or possibly some type of cellular organelle, 
which acts as a reservoir for a cytotoxic agent, such as ricin. The antibody 
then acts to target said liposome to a cancerous tissue in an animal, 
whereupon the liposome provides a source of cytotoxic agents for localized 

25 treatment of a solid tumor or other type of neoplasm. 

The present invention further encompasses an immunogenic 
composition comprising a polypeptide disclosed herein, as well as 
compositions formed using antibodies specific for these polypeptides. 

30 

Methods well known in the art for making formulations are found in, for 
example, Remington: The Science and Practice of Pharmacy, (19th ed.) Ed. 
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A.R. Gennaro, 1995, Mack Publishing Company, Easton, PA. Formulations 
for parenteral administration may, for example, contain excipients, sterile 
water, or saline, polyalkylene glycols such as polyethylene glycol, oils of 
vegetable origin, or hydrogenated napthalenes. Biocompatible, biodegradable 
5 lactide polymer, lactide/glycolide copolymer, or polyoxyethylene- 
polyoxypropylene copolymers may be used to control the release of the 
compounds. Other potentially useful parenteral delivery systems for agonists 
of the invention include ethylenevinyl acetate copolymer particles, osmotic 
pumps, implantable infusion systems, and liposomes. Formulations for 

10 inhalation may contain excipients, or example, lactose, or may be aqueous 
solutions containing, for example, polyoxyethylene-9-lauryl ether, glycocholate 
and deoxycholate, or may be oily solutions for administration in the form of 
nasal drops, or as a gel. It should be noted that, where the therapeutic agent 
to be administered is an immunoconjugate, these sometimes contain 

1 5 chemical linkages that are somewhat labile in aqueous media and therefor 
must be stored prior to administration is a more stable environment, such as 
in the form of a lyophilized powder. 

Such an agent can be a single molecular structure, comprising both 
20 affinity portion and anti-cancer activity portions, wherein said portions are 
derived from separate molecules, or molecular structures, possessing such 
activity when separated and wherein such agent has been formed by 
combining said portions into one larger molecular structure, such as where 
said portions are combined into the form of an adduct. Said anti-cancer and 
25 affinity portions may be joined covalently, such as in the form of a single 
polypeptide, or polypeptide-like, structure or may be joined non-covalently, 
such as by hydrophobic or electrostatic interactions, such structures having 
been formed by means well known in the chemical arts. Alternatively, the anti- 
cancer and affinity portions may be formed from separate domains of a single 
30 molecule that exhibits, as part of the same chemical structure, more than one 
activity wherein one of the activities is against cancer cells, or tumor formation 
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or growth, and the other activity is affinity for an expression product produced 
by expression of genes related to the cancerous process or condition. 

In one embodiment of the present invention, a chemical agent, such as 
5 a protein or other polypeptide, is joined to an agent, such as an antibody, 
having affinity for an expression product of a cancerous cell, such as a 
polypeptide or protein encoded by a gene related to the cancerous process, 
preferably a gene as disclosed herein according to the present invention, 
most preferably a polypeptide sequence disclosed herein. Thus, where the 

10 presence of said expression product is essential to tumor initiation and/or 
growth, binding of said agent to said expression product will have the effect of 
negating said tumor promoting activity. In one such embodiment, said agent is 
an apoptosis-inducing agent that induces cell suicide, thereby killing the 
cancer cell and halting tumor growth. 

15 Other genes within the cancer cell that are regulated in a manner 

similar to that of the genes disclosed herein and thus change their expression 
in a coordinated way in response to chemical compounds represent genes 
that are located within a common metabolic, signaling, physiological, or 
functional pathway so that by analyzing and identifying such commonly 

20 regulated groups of genes (groups that include the gene, or similar 
sequences, disclosed according to the invention, one can (a) assign known 
genes and novel genes to specific pathways and (b) identify specific functions 
and functional roles for novel genes that are grouped into pathways with 
genes for which their functions are already characterized or described. For 

25 example, one might identify a group of 10 genes, at least one of which is the 
gene as disclosed herein, that change expression in a coordinated fashion 
and for which the function of one, such as the polypeptide encoded by the 
sequence disclosed herein, is known then the other genes are thereby 
implicated in a similar function or pathway and may thus play a role in the 

30 cancer-initiating or cancer-facilitating process. In the same way, if a gene 
were found in normal cells but not in cancer cells, or happens to be expressed 
at a higher level in normal as opposed to cancer cells, then a similar 
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conclusion may be drawn as to its involvement in cancer, or other diseases. 
Therefore, the processes disclosed according to the present invention at once 
provide a novel means of assigning function to genes, i.e. a novel method of 
functional genomics, and a means for identifying chemical compounds that 
5 have potential therapeutic effects on specific cellular pathways. Such 
chemical compounds may have therapeutic relevance to a variety of diseases 
outside of cancer as well, in cases where such diseases are known or are 
demonstrated to involve the specific cellular pathway that is affected. 

10 The polypeptides disclosed herein, preferably those of SEQ ID NO: 5 

and 6, also find use as vaccines in that, where the polypeptide represents a 
surface protein present on a cancer cell, such polypeptide may be 
administered to an animal, especially a human being, for purposes of 
activating cytotoxic T lymphocytes (CTLs) that will be specific for, and act to 

1 5 lyze, cancer cells in said animal. Where used as vaccines, such polypeptides 
are present in the form of a pharmaceutical composition. The present 
invention may also employ polypeptides that have the same, or similar, 
immunogenic character as the polypeptides of SEQ ID NO: 5 and 6 and 
thereby elicit the same, or similar, immunogenic response after administration 

20 to an animal, such as an animal at risk of developing cancer, or afflicted 
therewith. Thus, the polypeptides disclosed according to the invention will 
commonly find use as immunogenic compositions. 

Expression of a gene corresponding to a polynucleotide disclosed 
25 herein, when in normal tissues, may indicate a predisposition towards 
development of breast cancer. The encoded polypeptide might then present a 
potentially useful cell surface target for therapeutic molecules such as 
cytolytic antibodies, or antibodies attached to cytotoxic, or cytolytic, agents. . 

30 The present invention specifically contemplates use of antibodies 

against the polypeptides encoded by the polynucleotides corresponding to the 
genes disclosed herein, whereby said antibodies are conjugates to one or 
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more cytotoxic agents so that the antibodies serve to target the conjugated 
immunotoxins to a region of cancerous activity, such as a solid tumor. For 
many known cytotoxic agents, lack of selectivity has presented a drawback to 
their use as therapeutic agents in the treatment of malignancies. For example, 
5 the class of two-chain toxins, consisting of a binding subunit (or B-chain) 
linked to a toxic subunit (A-chain) are extremely cytotoxic. Thus, such agents 
as ricin, a protein isolated from castor beans, kills cells at very low 
concentrations (even less than 10" 11 M) by inactivating ribosomes in said cells 
(see, for example, Lord et al., Ricin: structure, mode of action, and some 

10 current applications. Faseb J, 8; 201-208 (1994), and Blattler et al., Realizing 
the full potential of immunotoxins. Cancer Cells, 1. 50-55 (1989)). While 
isolated A-chains of protein toxins that functionally resemble ricin A-chain are 
only weakly cytotoxic for intact cells (in the concentration range of 10' 7 to 10~ 6 
M), they are very potent cytotoxic agents inside the cells. Thus, a single 

1 5 molecule of the A-subunit of diphtheria toxin can kill a cell once inside (see: 
Yamaizumi et al., One molecule of diphtheria toxin fragment A introduced into 
a cell can kill the cell. Cell, 15: 245-250, 1978). 

The present invention solves this selectivity problem by using 
antibodies specific for antigens present on cancer cells to target the cytotoxins 

20 to said cells. In addition, use of antibodies decreases toxicity because:the 
antibodies are non-toxic until they reach the tumor and, because the cytotoxin 
is bound to the antibody, it is presented with less opportunity to cause 
damage to non-targeted tissues. 

25 In addition, use of such antibodies alone can provide therapeutic 

effects on the tumor through the antibody-dependent cellular cytotoxic 
response (ADCC) and complement-mediated cell lysis mechanisms. 

A number of recombinant immunotoxins (for example, consisting of Fv 
30 regions of cancer specific antibodies fused to truncated bacterial toxins) are 
well known (see, for example, Smyth et al., Specific targeting of chlorambucil 
to tumors with the use of monoclonal antibodies, J. Natl. Cancer Inst, 
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76(3):503-510 (1986); Cho et al. v Single-chain Fv/folate conjugates mediate 
efficient lysis of folate-receptor-positive tumor cells, Bioconjug. Chem., 
8(3):338-346 (1997)). As noted in the literature, these may contain, for 
example, a truncated version of Pseudomonas exotoxin as a toxic moiety but 
5 the toxin is modified in such a manner that by itself it does not bind to normal 
human cells, but it retains all other functions of cytotoxicity. Here, recombinant 
antibody fragments target the modified toxin to cancer cells which are killed, 
such as by direct inhibition of protein synthesis, or by concomitant induction of 
apoptosis. Cells that are not recognized by the antibody fragment, because 

10 they do not carry the cancer antigen, are not affected. Good activity and 
specificity has been observed for many recombinant immunotoxins in in vitro 
assays using cultured cancer cells as well as in animal tumor models. 
Ongoing clinical trials provide examples where the promising pre-clinical data 
correlate with successful results in experimental cancer therapy, (see, for 

1 5 example, Brinkmann U., Recombinant antibody fragments and immunotoxin 
fusions for cancer therapy, In Vivo (2000) 14:21-27). 

While the safety of employing immunoconjugates in humans has been 
established, in vivo therapeutic results have been less impressive. Because 

20 clinical use of mouse MAbs in humans is limited by the development of a 
foreign anti-globulin immune response by the human host, genetically 
engineered chimeric human-mouse MAbs have been developed by replacing 
the mouse Fc region with the human constant region. In other cases, the 
mouse antibodies have been "humanized" by replacing the framework regions 

25 of variable domains of rodent antibodies by their human equivalents. Such 
humanized and engineered antibodies can even be structurally arranged to 
have specificities and effector functions determined by design and which 
characteristics do not appear in nature. The development of bispecific 
antibodies, having different binding ends so that more than one antigenic site 

30 can be bound, have proven useful in targeting cancer cells. Thus, such 
antibody specificity has been improved by chemical coupling to various 
agents such as bacterial or plant toxins, radionuclides or cytotoxic drugs and 
other agents, (see, for example, Bodey, B. et al). Genetically engineered 
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monoclonal antibodies for direct antineoplastic treatment and cancer cell 
specific delivery of chemotherapeutic agents. Curr Pharm Des (2000) 
Feb;6(3):261-76). See also, Garnett, M. C, Targeted drug conjugates: 
principles and progress. Adv. Drug Deliv. Rev. (2001 Dec 17) 53(2):171-216; 
5 Brinkmann et al., Recombinant immunotoxins for cancer therapy. Expert Opin 
Biol Then (2001 ) 1 (4):693-702. 

Among the cytotoxic agents specifically contemplated for use as 
immunoconjugates according to the present invention are Calicheamicin, a 

10 highly toxic enediyne antibiotic isolated from Micromonospora 
echinospora ssp. Ca/ichensis, and which binds to the minor groove of 
DNA to induce double strand breaks and cell death (see: Lee et al., 
Calicheamicins, a novel family of antitumor antibiotics. 1. Chemistry and 
partial structure of calichemicin gi. J Am Chem Soc, 109; 3464-3466 (1987); 

15 Zein et al., Calicheamicin gamma 11: an antitumor antibiotic that cleaves 
double-stranded DNA site specifically, Science, 240; 1198-1201 (1988)). 
Useful derivatives of the calicheamicins include mylotarg and 138H11-Cam9. 
Mylotarg is an immunoconjugate of a humanized anti-CD33 antibody (CD33 
being found in leukemic cells of most patients with acute myeloid leukemia) 

20 and N-acetyl gamma colicheamicin dimethyl hydrazide, the latter of which is 
readily coupled to an antibody of the present invention (in place of the anti- 
CD33 but which can also be humanized by substitution of human framework 
regions into the antibody during production as described elsewhere herein) to 
form an immunoconjugate of the invention, (see: Hamann et al. Gemtuzumab 

25 Ozogamicin, A Potent and Selective Anti-CD33 Antibody-Calicheamicin 
Conjugate for Treatment of Acute Myeloid Leukemia, Bioconjug. Chem. 13, 
47-58 (2002)) For use with 138H11-Came, 138H11 is an anti-y-glutamyl 
transferase antibody coupled to theta calicheamicin through a disulfide 
linkage and found useful in vitro against cultured renal cell carcinoma cells. 

30 (see: Knoll et al., Targeted therapy of experimental renal cell carcinoma with a 
novel conjugate of monoclonal antibody 138H11 and calicheamicin eV, 
Cancer Res, 60; 6089-6094 (2000) The same linkage may be utilized to link 
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this cytotoxic agent to an antibody of the present invention, thereby forming a 
targeting structure for breast cancer cells. 

Also useful in forming the immunoconjugates of the invention is DC1 , a 
5 disulfide-containing analog of adozelesin, that kills cells by binding to the 
minor groove of DNA, followed by alkylation of adenine bases. Adozelesin is a 
structural analog of CC-1065, an anti-tumor antibiotic isolated from microbial 
fermentation of Streptomyces zelensis, and is about 1 ,000 fold more toxic to 
cultured cell lines that other DNA interacting agents, such as cis-platin and 
1 0 doxorubicin. This agent is readily linked to antibodies through the disulfide 
bond of adozelesin. (see: Chari et al., Enhancement of the selectivity and 
antitumor efficacy of a CC-1065 analogue through immunoconjugate 
formation, Cancer Res, 55:4079-4084(1995)). 

1 5 Maytansine, a highly cytotoxic microtubular inhibitor isolated from the 

shrub Maytenus serrata found to have little value in human clinical trials, is 
much more effective in its derivatized form, denoted DM1, containing a 
disulfide bond to facilitate linkage to antibodies, is up to 10-fold more cytotoxic 
(see: Chari et al., Immunoconjugates containing novel maytansinoids: 

20 promising anticancer drugs, Cancer Res, 52; 127-131 (1992)). These same in 
vitro studies showed that up to four DM1 molecules could be linked to a single 
immunoglobulin without destroying the binding affinity. Such conjugates have 
been used against breast cancer antigens, such as the neu/HER2JerbB-2 
antigen, (see: Goldmacher et al., Immunogen, Inc., (2002) in press); also see 

25 Liu, C. et al., Eradication of large colon tumor xenografts by targeted delivery 
of maytansinoids, Proc. Natl. Acad. Sci. USA, 93, 8618-8623 (1996)). For 
example, Liu et al. (1996) describes formation of an immunoconjugate of the 
maytansinoid cytotoxin DM1 and C242 antibody, a murine lgG1 
immunoglobulin, available from Pharmacia and which has affinity for a mucin- 

30 like glycoprotein variably expressed by human colorectal cancers. The latter 
immunoconjugate was prepared according to Chari et al., Cancer Res., 
52:127-131 (1992) and was found to be highly cytotoxic against cultured colon 
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cancer cells as well as showing anti-tumor effects in vivo in mice bearing 
subcutaneous COLO 205 human colon tumor xenografts using doses well 
below the maximum tolerated dose. 

5 In addition, there are a variety of protein toxins (cytotoxic proteins), 

which include a number of different classes, such as those that inhibit protein 
synthesis: ribosome-inactivating proteins of plant origin, such as ricin, abrin, 
gelonin, and a number of others, and bacterial toxins such as pseudomonas ^ 
exotoxin and diphtheria toxin. 

10 

Another useful class is the one including taxol, taxotere, and taxoids. 
Specific examples include paclitaxel (taxol), its analog docetaxel (taxotere), 
and derivatives thereof. The first two are clinical drugs used in treating a 
number of tumors while the taxoids act to induce cell death by inhibiting the 
1 5 de-polymerization of tubulin. Such agents are readily linked to antibodies 
through disulfide bonds without disadvantageous effects on binding 
specificity. 

In one instance, a truncated Pseudomonas exotoxin was fused to an 
20 anti-CD22 variable fragment and used successfully to treat patients with 
chemotherapy-resistant hairy-cell leukemia, (see: Kreitman et al., Efficacy of 
the anti-CD22 recombinant immunotoxin BL22 in chemotherapy-resistant 
hairy-cell leukemia, N Engl J Med, 345; 241-247 (2001)) Conversely, the 
cancer-linked peptides of the present invention offer the opportunity to 
25 prepare antibodies, recombinant or otherwise, against the appropriate 
antigens to target solid tumors, preferably those of malignancies of breast 
tissue, using the same or similar cytotoxic conjugates. Thus, many of the 
previously used immunoconjugates have been formed using antibodies 
against general antigenic sites linked to cancers whereas the antibodies 
30 formed using the peptides disclosed herein are more specific and target the 
antibody-cytotoxic agent to a particular tissue or organ, thus further reducing 
toxicity and other undesirable side effects. 
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In accordance with the foregoing, a preferred embodiment includes 
where the cytotoxic agent is a calicheamicin, a maytansinoid, an 
adozelesin, DC1, a cytotoxic protein, a taxol, a taxotere, or a taxoid. In 
especially preferred embodiments, the calicheamicin is calicheamicin y^, 
5 N-acetyl gamma calicheamicin dimethyl hydrazide or calicheamicin 6/, the 
maytansinoid is DM1, the cytotoxic protein is ricin, abrin, gelonin, 
pseudomonas exotoxin or diphtheria toxin, the taxol is paclitaxel, and the 
taxotere is docetaxel. . 

10 In addition, the immunoconjugates formed using the antibodies 

prepared against the cancer-linked antigens disclosed herein can be formed 
by any type of chemical coupling. Thus, the cytotoxic agent of choice, along 
with the immunoglobulin, can be coupled by any type of chemical linkage, 
covalent or non-covalent, including electrostatic linkage, to form the 

1 5 immunoconjugates of the present invention. 

When used as immunoconjugates, the antitumor agents of the present 
invention represent a class of pro-drugs that are relatively non-toxic when first 
administered to an animal (due mostly to the stability of the 

20 immunoconjugate), such as a human patient, but which are targeted by the 
conjugated immunoglobulin to a cancer cell where they then exhibit good 
toxicity. The tumor-related, associated, or linked, antigens, preferably those 
presented herein, serve as targets for the antibodies (monoclonal, 
recombinant, and the like) specific for said antigens. The end result is the 

25 release of active cytotoxic agent inside the cell after binding of the 
immunoglobulin portion of the immunoconjugate. 

The cited references describe a number of useful procedures for the 
chemical linkage of cytotoxic agents to immunoglobulins and the disclosures 
30 of all such references cited herein are hereby incorporated by reference in 
their entirety. For other reviews see Ghetie et al., Immunotoxins in the therapy 
of cancer: from bench to clinic, Pharmacol Ther, 63: 209-234 (1994), Pietersz 
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et al. The use of monoclonal antibody immunoconjugates in cancer therapy, 
Adv Exp Med Biol, 353:169-179 (1994), and Pietersz, G. A. The linkage of 
cytotoxic drugs to monoclonal antibodies for the treatment of cancer, 
Bioconjug Chem, 1:89-95 (1990). 

5 

Thus, the present invention provides highly useful cancer-associated 
antigens for generation of antibodies for linkage to a number of different 
cytotoxic agents which are already known to have some in vitro toxicity and 
possess chemical groups available for linkage to antibodies. 

10 

The present invention also relates to a process that comprises a 
method for producing a product, including generating test data, comprising 
identifying an agent according to one of the disclosed processes for 
identifying such an agent (i.e., the therapeutic agents identified according to 

15 the assay procedures disclosed herein) wherein said product is the data 
collected with respect to said agent as a result of said identification process, 
or assay, and wherein said data is sufficient to convey the chemical character 
and/or structure and/or properties of said agent. For example, the present 
invention specifically contemplates a situation whereby a user of an assay of 

20 the invention may use the assay to screen for compounds having the desired 
enzyme modulating activity and, having identified the compound, then 
conveys that information (i.e., information as to structure, dosage, etc) to 
another user who then utilizes the information to reproduce the agent and 
administer it for therapeutic or research purposes according to the invention. 

25 For example, the user of the assay (user 1) may screen a number of test 
compounds without knowing the structure or identity of the compounds (such 
as where a number of code numbers are used the first user is simply given 
samples labeled with said code numbers) and, after performing the screening 
process, using one or more assay processes of the present invention, then 

30 imparts to a second user (user 2), verbally or in writing or some equivalent 
fashion, sufficient information to identify the compounds having a particular 
modulating activity (for example, the code number with the corresponding 
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results). This transmission of information from user 1 to user 2 is specifically 
contemplated by the present invention. 

It should be cautioned that, in carrying out the procedures of the 
5 present invention as disclosed herein, whether to form immunoconjugates or 
screen for other antitumor agents using the genes and polypeptides disclosed 
herein, any reference to particular buffers, media, reagents, cells, culture 
conditions and the like are not intended to be limiting, but are to be read so as 
to include all related materials that one of ordinary skill in the art would 

1 0 recognize as being of interest or value in the particular context in which that 
discussion is presented. For example, it is often possible to substitute one 
buffer system or culture medium for another and still achieve similar, if not 
identical, results. Those of skill in the art will have sufficient knowledge of 
such systems and methodologies so as to be able, without undue 

15 experimentation, to make such substitutions as will optimally serve their 
purposes in using the methods and procedures disclosed herein. 

The present invention will now be further described by way of the 
following non-limiting example. In applying the disclosure of the example, it 
20 should be kept clearly in mind that other and different embodiments of the 
methods disclosed according to the present invention will no doubt suggest 
themselves to those of skill in the relevant art. The following example shows 
how a potential antineoplastic agent may be identified using one or more of 
the genes disclosed herein. 

25 

EXAMPLE 

Determination of Gene Inhibitory Activity of an Antineoplastic Agent 

30 

SW480 cells are grown to a density of 10 5 cells/cm 2 in Leibovitz's L-15 
medium supplemented with 2 mM L-glutamine (90%) and 10% fetal bovine 
serum. The cells are collected after treatment with 0.25% trypsin, 0.02% 
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EDTA at 37°C for 2 to 5 minutes. The trypsinized cells are then diluted with 30 
ml growth medium and plated at a density of 50,000 cells per well in a 96 well 
plate (100 ^il/well). The following day, cells are treated with either compound 
buffer alone, or compound buffer containing a chemical agent to be tested, for 
5 24 hours. The media is then removed, the cells lysed and the RNA recovered 
using the RNAeasy reagents and protocol obtained from Qiagen. RNA is 
quantitated and 10 ng of sample in 1 \x\ are added to 24 ]i\ of Taqman reaction 
mix containing 1X PCR buffer, RNAsin, reverse transcriptase, nucleoside 
triphosphates, amplitaq gold, Tween 20, glycerol, bovine serum albumin 

10 (BSA) and specific PCR primers and probes for a reference gene (18S RNA) 
and a test gene (Gene X). Reverse transcription is then carried out at 48°C for 
30 minutes. The sample is then applied to a Perkin Elmer 7700 sequence 
detector and heat denatured for 10 minutes at 95°C. Amplification is 
performed through 40 cycles using 15 seconds annealing at 60°C followed by 

15 a 60 second extension at 72°C and 30 second denaturation at 95°C. Data 
files are then captured and the data analyzed with the appropriate baseline 
windows and thresholds. 

The quantitative difference between the target and reference gene is 
20 then calculated and a relative expression value determined for all of-the 
samples used. In this way, the ability of a chemotherapeutic agent to 
effectively and selectively reduce the activity of a cancer-specific gene is 
readily ascertained. The overall expression of the cancer-specific gene, as 
modulated by one chemical agent relative to another, is also determined. 
25 Chemical agents having the most effect in reducing gene activity are thereby 
identified as the most antineoplastic. 

30 
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WHAT IS CLAIMED IS: 

1 . A process for identifying an agent that modulates the activity of a 
cancer-related gene comprising: 
5 (a) contacting a compound with a cell containing a gene that 

corresponds to a polynucleotide having a sequence selected from the group 
consisting of SEQ ID NO: 1, 2, 3 and 4 and under conditions promoting the 
expression of said gene; and 

(b) detecting a difference in expression of said gene relative to when 
1 0 said compound is not present 

thereby identifying an agent that modulates the activity of a cancer- 
related gene. 

2 The process of claim 1 wherein said gene has a sequence selected 
1 5 from the group consisting of SEQ ID NO: 1, 2, 3 and 4. 

3. The process of claim 1 wherein the cell is a cancer cell and the 
difference in expression is a decrease in expression. 

20 4. The process of claim 3 wherein said cancer cell is a member 

selected from breast cancer, Wilms tumor and soft tissue fibromatosis. 

5. A process for identifying an antineoplastic agent comprising 
contacting a cell exhibiting neoplastic activity with a compound first identified 

25 as a cancer related gene modulator using the process of claim 1 and 
detecting a decrease in said neoplastic activity after said contacting compared 
to when said contacting does not occur. 

6. The process of claim 5 wherein said neoplastic activity is 
30 accelerated cellular replication. 
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7. The process of claim 5 wherein said decrease in neoplastic activity 
results from the death of the cell. 

8. A process for identifying an anti-neoplastic agent comprising 
5 administering to an animal exhibiting a cancer condition an effective amount 

of an agent first identified according to the process of claim 1 and detecting a 
decrease in said cancerous condition. 

9. A process for determining the cancerous status of a cell, comprising 
1 0 determining an increase in the level of expression in said cell of a gene that 

corresponds to a polynucleotide having a sequence selected from the group 
consisting of SEQ ID NO: 1, 2, 3 and 4 wherein an elevated expression 
relative to a known non-cancerous cell indicates a cancerous state or 
potentially cancerous state. 

15 

10. The process of claim 9 wherein said elevated expression is due to 
an increased copy number. 

11. An isolated polypeptide comprising an amino acid sequence 
20 homologous to an amino acid sequence selected from the group consisting of 

SEQ ID NO: 5 and 6 wherein any difference between said amino acid 
sequence and the sequence of SEQ ID NO: 5 and 6 is due solely to 
conservative amino acid substitutions and wherein said isolated polypeptide 
comprises at least one immunogenic fragment. 

25 

12. An isolated polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO: 5 and 6. 

13. An antibody that reacts with a polypeptide comprising an amino 
30 acid sequence selected from the group consisting of SEQ ID NO: 5 and 6. 
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14. The antibody of claim 13 wherein said antibody is a recombinant 
antibody. 

15. The antibody of claim 13 wherein said antibody is a synthetic 
5 antibody. 

16. The antibody of claim 13 wherein said antibody is a humanized 
antibody. 

10 17. An immunoconjugate comprising the antibody of claim 13 and a 

cytotoxic agent. 

18. The antibody of claim 17 wherein said cytotoxic agent is a member 
selected from the group consisting of a calicheamicin, a maytansinoid, an 

1 5 adozelesin, a cytotoxic protein, a taxol, a taxotere, a taxoid and DC1 . 

19. The immunoconjugate of claim 18 wherein said calicheamicin is 
calicheamicin y/, N-acetyl gamma calicheamicin dimethyl hydrazide or 
calicheamicin Oi 1 . 

20 

20. The immunoconjugate of claim 18 wherein said maytansinoid is 

DM1. 

21. The immunoconjugate of claim 18 wherein said cytotoxic protein is 
25 ricin, abrin, gelonin, pseudomonas exotoxin or diphtheria toxin. 

22. The immunoconjugate of claim 18 wherein said taxol is paclitaxel. 

23. The immunoconjugate of claim 18 wherein said taxotere is 
30 docetaxel. 
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24. A process for treating cancer comprising contacting a cancerous 
cell in vivo with an agent having activity against an expression product 
encoded by a gene sequence selected from the group consisting of SEQ ID 
NO: 1,2, 3 and 4. 

5 

25. The process of claim 24 wherein said agent is an antibody of claim 

13. 

26. The process of claim 24 wherein said agent is an immunoconjugate 
1 0 of claim 17. 

27. An immunogenic composition comprising a polypeptide of claim 11. 

28. An immunogenic composition comprising a polypeptide of claim 12. 

15 

29. The process of claim 24 wherein said cancer is a member selected 
from breast cancer, Wilms tumor and soft tissue fibromatosis. 

30. A process for treating cancer in an animal afflicted therewith 
20 comprising administering to said animal an amount of an immunogenic 

composition of claim 27 sufficient to elicit the production of cytotoxic T 
lymphocytes specific for the polypeptide of claim 1 1 . 

31. A process for treating cancer in an animal afflicted therewith 
25 comprising administering to said animal an amount of an immunogenic 

composition of claim 28 sufficient to elicit the production of cytotoxic T 
lymphocytes specific for the polypeptide of claim 12. 

32. A process for treating a cancerous condition in an animal afflicted 
30 therewith comprising administering to said animal a therapeutically effective 

amount of an agent first identified as having antineoplastic activity using the 
process of claim 8. 
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33. A process for protecting an animal against cancer comprising 
administering to an animal at risk of developing cancer a therapeutically 
effective amount of an agent first identified as having antineoplastic activity 
using the process of claim 8. 

5 

34. The process of claim 30 wherein said animal is a human being. 

35. The process of claim 30 wherein said cancer is a member selected 
from breast cancer, Wilms tumor and soft tissue fibromatosis. 

10 

36. A method for producing test data with respect to the gene 
modulating activity of a compound comprising: 

(a) contacting a compound with a cell containing a polynucleotide 
comprising a nucleotide sequence corresponding to a gene whose expression 

15 is increased in a cancerous cell over that in a non-cancerous cell and under 
conditions wherein said polynucleotide is being expressed, 

(b) determining a change in expression of polynucleotides as a result of 
said contacting, and 

(c) producing test data with respect to the gene modulating activity of 
20 said compound based on a decrease in the expression of the determined 

gene whose expression is otherwise increased in a cancerous cell over that in 
a non-cancerous cell indicating gene modulating activity. 
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SEQUENCE LISTING 



<110> Avalon Pharmaceuticals 

<120> Cancer-Linked Gene as Target for Chemotherapy 

<130> 689290-159 

<140> 
<141> 

<150> US/60/388,075 

<151> 2002-06-11 

<160> 6 

<170> Patentln version 3.0 



<210> 1 

<211> 6843 

<212> DNA 

<213> Homo sapiens 

<400> 1 

gttgcagtga gccgagatcg crccactgca ctccagcctg ggcgacrgag cgagactttg 
tytcaagatt actctaaaca gaaacaagag tgaaataaat ttgtggagtt aataataaaa 
atgttattct ctgtggtatg ttttcttttc ttttgttttt ttrrgacrgg gtcttgctct 
gttgcccagg ctggagtgca gtggcgccat cttggctcac tgcrrcctct gcctcccrgg 
ttcragcgat tctcctgcct crggcccctg agtrgctggg attacaggca cgtrccacca 
cgcctagcta atttttatat ttttagtaga gacrgggttt crccatgttg gtcatgttgg 
tctcgaactc ctgacctcat gatctgcccc ctgccrgcct cccacagtgc tgggattrca 
ggcgtgagcc accgcgcccg gcctgtattt tttcatttca tagaatttcc agataaaagt 
cagagtcaac tagttttata atactgaaaa gttttgcaaa tacaagcatg tcacttaatg 
atggggatgt gttctgagaa atgtgtcatt aggtgattcc gtcattgtgc aaatgtcata 
gagtatactt acataaacct aggtggtatg tagcctatat gatatatgtg gcttatatga 
tataccctat tgctcctagg ctgcaaacct gtatagcatg ttactatcct gaagtcaatg 
ggcaattgta acacaatggc aagtatttgt gtattgaaac ataagggtat agtaaaaata 
tggtactatt aatacaactg tatgggacta ccattgtata tgcggtccat cattggccaa 
actttcttta tggtggcaca agactacata ataaatggta aagtaaccag acttggagat 
ggggaggagg gagaagaatg ccaaatgtat gtttcctccc acagactagc aggcatctag 
tatcaggcaa cayacttata caatcatttc ctactcctgt ctcacacttg ccagaatctg 
gatttaattt ttttttttta ttgttagacc agtaatgcag ttttcatatt ttgtataaca 
aaaaagttgt aggacttttc cttcttactg tgaaaatatg actcattaaa tacataactt 
ttagtattgg ccatttgtat tcttatactc ttagtcacaa agataactac gttttacaaa 
gtcgaaaggt taataatagc agctagcatg aattcagagc ttgctatgta ttgagcactg 
ggtttaaaca catatgctct attttattaa gtccttatag taggaaagtg aagaatacac 
tagtcactga agttccttca gtgctcaaga ctcaagtgca ctatagcaag acaaacacca 
ggaaacatgt aactctgctt cgaatacttt agcaagtgaa tggcccagtt tgggaaagca 
agacaaacac tggatccata atggcttttt acacctgatt ggaatgcaga aaaaatggaa 
ccactaacta ctaatattat ctttgccagc aggtgagcag tacatacctg acttcagagg 
attttattga aactcaatag atgatggtta ataaaaacaa atatcctcag ctttccttct 
tttacttgtc atctgacaac cactatttag ctaaaactaa aatggattag agatacaaga 
tgatgaataa caattattaa cttttgtgag catataatgt gagtcagcta ctacactaag 
aactttacat agattccctt aattttctca ataatcctat gagaactgtc cttgtatgag 
caaccccatt ttatagctga agatacttga ggcttcaaga ggttatataa acatgttcat 
ggtcaacagt tattaagcag tgagtgacat tctcacgaga ttaatgagat catctgtgtg 
agatacaaat acatctattg tgtatgtttg tgtttttaat ggagaccaat ggtatgacag 
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aatgaagtcc ttttatcagt cagtcaagat ctgatgtggc tattcaaata taggcaaaac 204 0 

tgatgttttc cttttttagt taagttaaat gtactattta aataactgtg aatatttaga 2100 

atattcacag aattgtgcaa ccatcaccac aatctttaga atattttcat catctcaaaa 2160 

agatactttg tagcctttag cacccttccc actgtacccc cccccatacc cctggccata 2220 

ggcaactact aaattacttt ctgtccctgc aggttgtcct attctggata ctttgtatat 2280 

gtggaatcat aaaataggtg atcttttatg actggcatct ttcaagcatg tttttgatat 2340 

tttcatccat gttgtaggat attaaattaa ctttttgcat aggaataact gagctagaaa 2400 

agcatgggct gaatatgtct gcagaatgta gcatttagaa aagaaagact tccaggagac 2 4 60 

aaatgaggta gtaactaaaa tgtaaataac taaaaattag ctttgagtct ttaataaact 2520 

acctttcatg ttttgctttc actaggaatg aacttttaat tttttggagc tataaataca 2580 

tgtagtttct aaagacatct ttgtttttaa agaaaagaca tctgcaggct ttttcttagt 2640 

gttcctatta agtatgtatg agtgtgtgca tttgtgtgtg tgtgtgtgca tgtttagtga 2700 

ctttacatgt gtatacatac aaatgtacac agtcatatat gactagtggt acatggaatc 27 60 

tcaatttaat tgtaaaagtg aaggagttct aaaactagtt ttagtatagt aagtgggata 2820 

tagtaagaat gtcctacaat ttaaaagctt atttggttat cttcaatgga ttttgaatgt 2880 

attcaatact agatttgcga gtgactatgc ctttttagta aacaaaagaa aggctataat 294 0 

gtagaaagta tatggaataa gcataacatc tgtcctaatc taattttcag gcacatctcc 3000 

agacttcaat ttggctgaaa taattcatgc cacggacctg tgcacatgcc tggaattgag 3060 

agacacagtt aaaagactcc aagttgcttt ctgccttttg aaaactcctg aaaaccatcc 3120 

ctctggactc tggaattcta cacagctcaa ccaagacttt gcttgaatgt ttacattttc 3180 

tgctcgctgt cctacatatc acaatatagt gttcacgttt tgttaaaact ttggggtgtc 324 0 

aggagttgag cttgctcagc aagccagcat ggctaggatg agctttgtta tagcagcttg 3300 

ccaattggtg ctgggcctac taatgacttc attaaccgag tcttccatac agaatagtga 3360 

gtgtccacaa ctttgcgtat gtgaaattcg tccctggttt accccacagt caacttacag 3420 

agaagccacc actgttgatt gcaatgacct ccgcttaaca aggattccca gtaacctctc 34 80 

tagtgacaca caagtgcttc tcttacagag caataacatc gcgaagactg tggatgagct 3540 

gcagcagctt ttcaacttga ctgaactaga tttctcccaa aacaacttta ctaacattaa 3600 

ggaggtcggg ctggcaaacc taacccagct cacaacgctg catttggagg aaaatcagat 3660 

taccgagatg actgattact gtctacaaga cctcagcaac cttcaagaac tctacatcaa 3720 

ccacaaccaa attagcacta tttctgctca tgcttttgca ggcttaaaaa atctattaag 3780 

gctccacctg aactccaaca aattgaaagt tattgatagt cgctggtttg attctacacc 3840 

caacctggaa attctcatga tcggagaaaa ccctgtgatt ggaattctgg atatgaactt 3900 

caaacccctc gcaaatttga gaagcttagt tttggcagga atgtatctca ctgatattcc 3960 

tggaaatgct ttggtgggtc tggatagcct tgagagcctg tctttttatg ataacaaact 4 020 

ggttaaagtc cctcaacttg ccctgcaaaa agttccaaat ttgaaattct tagacctcaa 4 080 

caaaaacccc attcacaaaa tccaagaagg ggacttcaaa aatatgcttc ggttaaaaga 4140 

actgggaatc aacaatatgg gcgagctcgt ttctgtcgac cgctatgccc tggataactt 4 200 

gcctgaactc acaaagctgg aagccaccaa taaccctaaa ctctcttaca tccaccgctt 4260 

ggctttccga agtgtccctg ctctggaaag cttgatgctg aacaacaatg ccttgaatgc 4 320 

catttaccaa aagacagtcg aatccctccc caatctgcgt gagatcagta tccatagcaa 4 380 

tcccctcagg tgtgactgtg tgatccactg gattaactcc aacaaaacca acatccgctt 4 4 40 

catggagccc ctgtccatgt tctgtgccat gccgcccgaa tataaagggc accaggtgaa 4 500 

ggaagtttta atccaggatt cgagtgaaca gtgcctccca atgatatctc acgacagctt 4 560 

cccaaatcgt ttaaacgtgg atatcggcac gacggttttc ctagactgtc gagccatggc 4 620 

tgagccagaa cctgaaattt actgggtcac tcccattgga aataagataa ctgtggaaac 4 680 

cctttcagat aaatacaagc taagtagcga aggtaccttg gaaatatcta acatacaaat 4740 

tgaagactca ggaagataca catgtgttgc ccagaatgtc caaggggcag acactcgggt 4 800 

ggcaacaatt aaggttaacg ggacccttct ggatggtacc caggtgctaa aaatatacgt 4 860 

caagcagaca gaatcccatt ccatcttagt gtcctggaaa gttaattcca atgtcatgac 4 920 

gtcaaactta aaatggtcgt ctgccaccat gaagattgat aaccctcaca taacatatac 4 980 

tgccagggtc ccagtcgatg tccatgaata caacctaacg catctgcagc cttccacaga 5040 

ttatgaagtg tgtctcacag tgtccaatat tcatcagcag actcaaaagt catgcgtaaa 5100 

tgtcacaacc aaaaatgccg ccttcgcagt ggacatctct gatcaagaaa ccagtacagc 5160 

ccttgctgca gtaatggggt ctatgtttgc cgtcattagc cttgcgtcca ttgctgtgta 5220 

ctttgccaaa agatttaaga gaaaaaacta ccaccactca ttaaaaaagt atatgcaaaa 5280 

aacctcttca atcccactaa atgagctgta cccaccactc attaacctct gggaaggtga 534 0 

cagcgagaaa gacaaagatg gttctgcaga caccaagcca acccaggtcg acacatccag 5400 

aagctattac atgtggtaac tcagaggata ttttgcttct ggtagtaagg agcacaaaga 54 60 

cgtttttgct ttattctgca aaagtgaaca agttgaagac ttttgtattt ttgactttgc 5520 
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tagtttgtgg cagagtggag aggacgggtg gatatttcaa atttttttag tatagcgtat 5580 

cgcaagggtt tgacacggct gccagcgact ctaggcttcc agtctgtgtt tggtttttat 5640 

tcttatcatt attatgattg ttattatatt attattttat tttagttgtt gtgctaaact 5700 

caataatgct gttctaacta cagtgctcaa taaaatgatt aatgacagga tggggttccc 57 60 

ctgtgctttt accagtagca tgaccccttc tgaagccatc cgtagaaagt actttgtcct 5820 

ccaaaaagct aacatacggt tttgaagcag cattgaaact tttgtagcaa tctggtctat 5880 

agacttttaa ctcaagaagc taaggctaga cttgttacct tcgttgaatg atgttagttg 594 0 

actgtactgt aatgttgtat caactgaatt gaatgtttgc ctttaaacaa tgaattttct 6000 

ttttctttcc tttttttttt tttgttgtaa tagttaaaga ggcttagaac aagctaacag 6060 

gcaatagaaa tatgtatatc agatttttta atgtaacaaa ctacatgtta attgttatct 6120 

tattcttttt atctttagta gacactttta aaagaaaaga caattttgtt gtgtttaact 6180 

caccaacacg tggtgtataa tgaagacaga actataataa attagttttg ttctgatttt 6240 

ttagaacact tgcaataatg tatcatttat agttcttgct agttgcagtg gtaatatttt 6300 

tcacatccat aaaaacaact accaaaataa atcagctgta gcatgttgct ttttaaagct 6360 

aggccctaaa aggttttaat tctttttcta agggaagaaa tgtctatttt aattaagata 6420 

ttttaatgaa caggatttct gtattttaaa tagtactgac tagcacctaa tgggcagtgg 6480 

gagggtggtt catatgaaga aaaaaaaggt gtattgttgt atcccatgca taaataaagg 654 0 

taaatatata tatacaccaa tatattcata tatactcaca cacatcccaa cctgtcacac 6600 

acaatgcgtg tgtatatata tatgaatata ttggatatgt catttctgta agagttttgt 6660 

taaaacctga ttttcttttg tagtatccac attcttcatc aaagtacaaa aacgtctgtg 6720 

gagtgtcaca aactgtatga catgttattt ctttttaaca gttgtctata tgcttagacc 6780 

cgtgttagtc tctatatctg tgtggcaata tctgctgaga caagtaaatg attaatagaa 6840 

aac 68 4 3 



<210> 2 

<211> 6243 

<212> DNA 

<213> Homo sapiens 



<400> 2 

gttgcagtga gccgagatcg crccactgca ctccagcctg ggcgacrgag cgagactttg 60 

tytcaagatt actctaaaca gaaacaagag tgaaataaat ttgtggagtt aataataaaa 120 

atgttattct ctgtggtatg ttttcttttc ttttgttttt ttrrgacrgg gtcttgctct 180 

gttgcccagg ctggagtgca gtggcgccat cttggctcac tgcrrcctct gcctcccrgg 240 

ttcragcgat tctcctgcct crggcccctg agtrgctggg attacaggca cgtrccacca .300 

cgcctagcta atttttatat ttttagtaga gacrgggttt crccatgttg gtcatgttgg 360 

tctcgaactc ctgacctcat gatctgcccc ctgccrgcct cccacagtgc tgggattrca 4 20 

ggcgtgagcc accgcgcccg gcctgtattt tttcatttca tagaatttcc agataaaagt 480 

cagagtcaac tagttttata atactgaaaa gttttgcaaa taicaagcatg tcacttaatg 54 0 

atggggatgt gttctgagaa atgtgtcatt aggtgattcc gtcattgtgc aaatgtcata 600 

gagtatactt acataaacct aggtggtatg tagcctatat gatatatgtg gcttatatga 660 

tataccctat tgctcctagg ctgcaaacct gtatagcatg ttactatcct gaagtcaatg 720 

ggcaattgta acacaatggc aagtatttgt gtattgaaac ataagggtat agtaaaaata 780 

tggtactatt aatacaactg tatgggacta ccattgtata tgcggtccat cattggccaa 840 

actttcttta tggtggcaca agactacata ataaatggta aagtaaccag acttggagat 900 

ggggaggagg gagaagaatg ccaaatgtat gtttcctccc acagactagc aggcatctag 960 

tatcaggcaa cayacttata caatcatttc ctactcctgt ctcacacttg ccagaatctg 1020 

gatttaattt ttttttttta ttgttagacc agtaatgcag ttttcatatt ttgtataaca 1080 

aaaaagttgt aggacttttc cttcttactg tgaaaatatg actcattaaa tacataactt 1140 

ttagtattgg ccatttgtat tcttatactc ttagtcacaa agataactac gttttacaaa 1200 

gtcgaaaggt taataatagc agctagcatg aattcagagc ttgctatgta ttgagcactg 1260 

ggtttaaaca catatgctct attttattaa gtccttatag taggaaagtg aagaatacac 1320 

tagtcactga agttccttca gtgctcaaga ctcaagtgca ctatagcaag acaaacacca 1380 

ggaaacatgt aactctgctt cgaatacttt agcaagtgaa tggcccagtt tgggaaagca 14 40 

agacaaacac tggatccata atggcttttt acacctgatt ggaatgcaga aaaaatggaa 1500 

ccactaacta ctaatattat ctttgccagc aggtgagcag tacatacctg acttcagagg 1560 

attttattga aactcaatag atgatggtta ataaaaacaa atatcctcag ctttccttct 1620 

tttacttgtc atctgacaac cactatttag ctaaaactaa aatggattag agatacaaga 1680 
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tgatgaataa caattattaa cttttgtgag catataatgt gagtcagcta ctacactaag 17 4 0 

aactttacat agattccctt aattttctca ataatcctat gagaactgtc cttgtatgag 1800 

caaccccatt ttatagctga agatacttga ggcttcaaga ggttatataa acatgttcat 18 60 

ggtcaacagt tattaagcag tgagtgacat tctcacgaga ttaatgagat catctgtgtg 1920 

agatacaaat acatctattg tgtatgtttg tgtttttaat ggagaccaat ggtatgacag 1980 

aatgaagtcc ttttatcagt cagtcaagat ctgatgtggc tattcaaata taggcaaaac 204 0 

tgatgttttc cttttttagt taagttaaat gtactattta aataactgtg aatatttaga 2100 

atattcacag aattgtgcaa ccatcaccac aatctttaga atattttcat catctcaaaa 2160 

agatactttg tagcctttag cacccttccc actgtacccc cccccatacc cctggccata 2220 

ggcaactact aaattacttt ctgtccctgc aggttgtcct attctggata ctttgtatat 2280 

gtggaatcat aaaataggtg atcttttatg actggcatct ttcaagcatg tttttgatat 2340 

tttcatccat gttgtaggat attaaattaa ctttttgcat aggaataact gagctagaaa 2400 

agcatgggct gaatatgtct gcagaatgta gcatttagaa aagaaagact tccaggagac 24 60 

aaatgaggta gtaactaaaa tgtaaataac taaaaattag ctttgagtct ttaataaact 2520 

acctttcatg ttttgctttc actaggaatg aacttttaat tttttggagc tataaataca 2580 

tgtagtttct aaagacatct ttgtttttaa agaaaagaca tctgcaggct ttttcttagt 2640 

gttcctatta agtatgtatg agtgtgtgca tttgtgtgtg tgtgtgtgca tgtttagtga 2700 

ctttacatgt gtatacatac aaatgtacac agtcatatat gactagtggt acatggaatc 27 60 

tcaatttaat tgtaaaagtg aaggagttct aaaactagtt ttagtatagt aagtgggata 2820 

tagtaagaat gtcctacaat ttaaaagctt atttggttat cttcaatgga ttttgaatgt 2880 

attcaatact agatttgcga gtgactatgc ctttttagta aacaaaagaa aggctataat 2940 

gtagaaagta tatggaataa gcataacatc tgtcctaatc taattttcag gcacatctcc 3000 

agacttcaat ttggctgaaa taattcatgc cacggacctg tgcacatgcc tggaattgag 3060 

agacacagtt aaaagactcc aagttgcttt ctgccttttg aaaactcctg aaaaccatcc 3120 

ctctggactc tggaattcta cacagctcaa ccaagacttt gcttgaatgt ttacattttc 3180 

tgctcgctgt cctacatatc acaatatagt gttcacgttt tgttaaaact ttggggtgtc 3240 

aggagttgag cttgctcagc aagccagcat ggctaggatg agctttgtta tagcagcttg 3300 

ccaattggtg ctgggcctac taatgacttc attaaccgag tcttccatac agaatagtga 3360 

gtgtccacaa ctttgcgtat gtgaaattcg tccctggttt accccacagt caacttacag 3420 

agaagccacc actgttgatt gcaatgacct ccgcttaaca aggattccca gtaacctctc 3480 

tagtgacaca caagtgcttc tcttacagag caataacatc gcgaagactg tggatgagct 354 0 

gcagcagctt ttcaacttga ctgaactaga tttctcccaa aacaacttta ctaacattaa 3600 

ggaggtcggg ctggcaaacc taacccagct cacaacgctg catttggagg aaaatcagat 3660 

taccgagatg actgattact gtctacaaga cctcagcaac cttcaagaac tctacatcaa 3720 

ccacaaccaa attagcacta tttctgctca tgcttttgca ggcttaaaaa atctattaag 3780 

gctccacctg aactccaaca aattgaaagt tattgatagt cgctggtttg attctacacc 3840 

caacctggaa attctcatga tcggagaaaa ccctgtgatt ggaattctgg atatgaactt 3900 

caaacccctc gcaaatttga gaagcttagt tttggcagga atgtatctca ctgatattcc 3960 

tggaaatgct ttggtgggtc tggatagcct tgagagcctg tctttttatg ataacaaact 4 020 

ggttaaagtc cctcaacttg ccctgcaaaa agttccaaat ttgaaattct tagacctcaa 4 080 

caaaaacccc attcacaaaa tccaagaagg ggacttcaaa aatatgcttc ggttaaaaga ' 4140 

actgggaatc aacaatatgg gcgagctcgt ttctgtcgac cgctatgccc tggataactt 4200 

gcctgaactc acaaagctgg aagccaccaa taaccctaaa ctctcttaca tccaccgctt 4260 

ggctttccga agtgtccctg ctctggaaag cttgatgctg aacaacaatg ccttgaatgc 4 320 

catttaccaa aagacagtcg aatccctccc caatctgcgt gagatcagta tccatagcaa 4380 

tcccctcagg tgtgactgtg tgatccactg gattaactcc aacaaaacca acatccgctt 4 4 40 

catggagccc ctgtccatgt tctgtgccat gccgcccgaa tataaagggc accaggtgaa 4 500 

ggaagtttta atccaggatt cgagtgaaca gtgcctccca atgatatctc acgacagctt 4 560 

cccaaatcgt ttaaacgtgg atatcggcac gacggttttc ctagactgtc gagccatggc 4 620 

tgagccagaa cctgaaattt actgggtcac tcccattgga aataagataa ctgtggaaac 4 680 

cctttcagat aaatacaagc taagtagcga aggtaccttg gaaatatcta acatacaaat 4740 

tgaagactca ggaagataca catgtgttgc ccagaatgtc caaggggcag acactcgggt 4 800 

ggcaacaatt aaggttaacg ggacccttct ggatggtacc caggtgctaa aaatatacgt 4 860 

caagcagaca gaatcccatt ccatcttagt gtcctggaaa gttaattcca atgtcatgac 4 920 

gtcaaactta aaatggtcgt ctgccaccat gaagattgat aaccctcaca taacatatac 4 980 

tgccagggtc ccagtcgatg tccatgaata caacctaacg catctgcagc cttccacaga 5040 

ttatgaagtg tgtctcacag tgtccaatat tcatcagcag actcaaaagt catgcgtaaa 5100 

tgtcacaacc aaaaatgccg ccttcgcagt ggacatctct gatcaagaaa ccagtacagc 5160 

ccrcgctgca gtaatggggt ctatgtttgc cgtcattagc cttgcgtcca ttgctgtgta 5220 
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ctttgccaaa 
aacctcttca 
cagcgagaaa 
aagctattac 
cgtttttgct 
tagtttgtgg 
cgcaagggtt 
tcttatcatt 
caataatgct 
ctgtgctttt 
ccaaaaagct 
agacttttaa 
actgtactgt 
ttttctttcc 
gcaatagaaa 
tattcttttt 
caccaacacg 
tta 



agatttaaga 
atcccactaa 
gacaaagatg 
atgtggtaac 
ttattctgca 
cagagtggag 
tgacacggct 
attatgattg 
gttctaacta 
accagtagca 
aacatacggt 
ctcaagaagc 
aatgttgtat 
tttttttttt 
tatgtatatc 
atctttagta 
tggtgtataa 



gaaaaaacta 
atgagctgta 
gttctgcaga 
tcagaggata 
aaagtgaaca 
aggacgggtg 
gccagcgact 
ttattatatt 
cagtgctcaa 
tgaccccttc 
tttgaagcag 
taaggctaga 
caactgaatt 
tttgttgtaa 
agatttttta 
gacactttta 
tgaagacaga 



ccaccactca 
cccaccactc 
caccaagcca 
ttttgcttct 
agttgaagac 
gatatttcaa 
ctaggcttcc 
attattttat 
taaaatgatt 
tgaagccatc 
cattgaaact 
cttgttacct 
gaatgtttgc 
tagttaaaga 
atgtaacaaa 
aaagaaaaga 
actataataa 



ttaaaaaagt 
attaacctct 
acccaggtcg 
ggtagtaagg 
ttttgtattt 
atttttttag 
agtctgtgtt 
tttagttgtt 
aatgacagga 
cgtagaaagt 
tttgtagcaa 
tcgttgaatg 
ctttaaacaa 
ggcttagaac 
ctacatgtta 
caattttgtt 
attagttttg 



atatgcaaaa 
gggaaggtga 
acacatccag 
agcacaaaga 
ttgactttgc 
tatagcgtat 
tggtttttat 
gtgctaaact 
tggggttccc 
actttgtcct 
tctggtctat 
atgttagttg 
tgaattttct 
aagctaacag 
attgttatct 
gtgtttaact 
ttctgatttt 



5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6243 



<210> 3 

<211> 3912 

<212> DNA 

<213> Homo sapiens 

<400> 3 

cgccgccgtg ggtgccgggt ccgcgcgcac cccaacaccc ccaccagctg ggcctcgggg 60 

cacatctcca gacttcaatt tggctgaaat aattcatgcc acggacctgt gcacatgcct 120 

ggaattgaga gacacagtta aaagactcca agttgctttc tgccttttga aaactcctga 180 

aaaccatccc tctggactct ggaattctac acagctcaac caagactttg cttgaatgtt 24 0 

tacattttct gctcgctgtc ctacatatca caatatagtg ttcacgtttt gttaaaactt 300 

tggggtgtca ggagttgagc ttgctcagca agccagcatg gctaggatga gctttgttat 360 

agcagcttgc caattggtgc tgggcctact aatgacttca ttaaccgagt cttccataca 420 

gaatagtgag tgtccacaac tttgcgtatg tgaaattcgt ccctggttta ccccacagtc 480 

aacttacaga gaagccacca ctgttgattg caatgacctc cgcttaacaa ggattcccag 540 

taacctctct agtgacacac aagtgcttct cttacagagc aataacatcg cgaagactgt 600 

ggatgagctg cagcagcttt tcaacttgac tgaactagat ttctcccaaa acaactttac 660 

taacattaag gaggtcgggc tggcaaacct aacccagctc acaacgctgc atttggagga 720 

aaatcagatt accgagatga ctgattactg tctacaagac ctcagcaacc ttcaagaact 780 

ctacatcaac cacaaccaaa ttagcactat ttctgctcat gcttttgcag gcttaaaaaa 84 0 

tctattaagg ctccacctga actccaacaa attgaaagtt attgatagtc gctggtttga 900 

ttctacaccc aacctggaaa ttctcatgat cggagaaaac cctgtgattg gaattctgga 960 

tatgaacttc aaacccctcg caaatttgag aagcttagtt ttggcaggaa tgtatctcac 1020 

tgatattcct ggaaatgctt tggtgggtct ggatagcctt gagagcctgt ctttttatga 1080 

taacaaactg gttaaagtcc ctcaacttgc cctgcaaaaa gttccaaatt tgaaattctt 1140 

agacctcaac aaaaacccca ttcacaaaat ccaagaaggg gacttcaaaa atatgcttcg 1200 

gttaaaagaa ctgggaatca acaatatggg cgagctcgtt tctgtcgacc gctatgccct 1260 

ggataacttg cctgaactca caaagctgga agccaccaat aaccctaaac tctcttacat 1320 

ccaccgcttg gctttccgaa gtgtccctgc tctggaaagc ttgatgctga acaacaatgc 1380 

cttgaatgcc atttaccaaa agacagtcga atccctcccc aatctgcgtg agatcagtat 14 4 0 

ccatagcaat cccctcaggt gtgactgtgt gatccactgg attaactcca acaaaaccaa 1500 

catccgcttc atggagcccc tgtccatgtt ctgtgccatg ccgcccgaat ataaagggca 1560 

ccaggtgaag gaagttttaa tccaggattc gagtgaacag tgcctcccaa tgatatctca 1620 

cgacagcttc ccaaatcgtt taaacgtgga tatcggcacg acggttttcc tagactgtcg 1680 

agccatggct gagccagaac ctgaaattta ctgggtcact cccattggaa ataagataac 1740 

tgtggaaacc ctttcagata aatacaagct aagtagcgaa ggtaccttgg aaatatctaa 1800 

catacaaatt gaagactcag gaagatacac atgtgttgcc cagaatgtcc aaggggcaga 1860 

cactcgggtg gcaacaatta aggttaacgg gacccttctg gatggtaccc aggtgctaaa 1920 

aatatacgtc aagcagacag aatcccattc catcttagtg tcctggaaag ttaattccaa 1980 
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tgtcatgacg 
aacatatact 
ttccacagat 
atgcgtaaat 
cagtacagcc 
tgctgtgtac 
tatgcaaaaa 
ggaaggtgac 
cacatccaga 
gcacaaagac 
tgactttgct 
atagcgtatc 
ggtttttatt 
tgctaaactc 
ggggttcccc 
ctttgtcctc 
ctggtctata 
tgttagttga 
gaattttctt 
agctaacagg 
ttgttatctt 
tgtttaactc 
tctgattttt 
taatattttt 
tttaaagcta 
attaagatat 
gggcagtggg 
aaataaaggt 
ctgtcacaca 
gagttttgtt 
acgtctgtgg 
gcttagaccc 
ttaatagaaa 



tcaaacttaa 
gccagggtcc 
tatgaagtgt 
gtcacaacca 
cttgctgcag 
tttgccaaaa 
acctcttcaa 
agcgagaaag 
agctattaca 
gtttttgctt 
agtttgtggc 
gcaagggttt 
cttatcatta 
aataatgctg 
tgtgctttta 
caaaaagcta 
gacttttaac 
ctgtactgta 
tttctttcct 
caatagaaat 
attcttttta 
accaacacgt 
tagaacactt 
cacatccata 
ggccctaaaa 
tttaatgaac 
agggtggttc 
aaatatatat 
caatgcgtgt 
aaaacctgat 
agtgtcacaa 
gtgttagtct 
ac 



aatggtcgtc 
cagtcgatgt 
gtctcacagt 
aaaatgccgc 
taatggggtc 
gatttaagag 
tcccactaaa 
acaaagatgg 
tgtggtaact 
tattctgcaa 
agagtggaga 
gacacggctg 
ttatgattgt 
ttctaactac 
ccagtagcat 
acatacggtt 
tcaagaagct 
atgttgtatc 
tttttttttt 
atgtatatca 
tctttagtag 
ggtgtataat 
gcaataatgt 
aaaacaacta 
ggttttaatt 
aggatttctg 
atatgaagaa 
atacaccaat 
gtatatatat 
tttcttttgt 
actgtatgac 
ctatatctgt 



tgccaccatg 

ccatgaatac 

gtccaatatt 

cttcgcagtg 

tatgtttgcc 

aaaaaactac 

tgagctgtac 

ttctgcagac 

cagaggatat 

aagtgaacaa 

ggacgggtgg 

ccagcgactc 

tattatatta 

agtgctcaat 

gaccccttct 

ttgaagcagc 

aaggctagac 

aactgaattg 

ttgttgtaat 

gattttttaa 

acacttttaa 

gaagacagaa 

atcatttata 

ccaaaataaa 

ctttttctaa 

tattttaaat 

aaaaaaggtg 

atattcatat 

atgaatatat 

agtatccaca 

atgttatttc 

gtggcaatat 



aagattgata 

aacctaacgc 

catcagcaga 

gacatctctg 

gtcattagcc 

caccactcat 

ccaccactca 

accaagccaa 

tttgcttctg 

gttgaagact 

atatttcaaa 

taggcttcca 

ttattttatt 

aaaatgatta 

gaagccatcc 

attgaaactt 

ttgttacctt 

aatgtttgcc 

agttaaagag 

tgtaacaaac 

aagaaaagac 

ctataataaa 

gttcttgcta 

tcagctgtag 

gggaagaaat 

agtactgact 

tattgttgta 

atactcacac 



accctcacat 

atctgcagcc 

ctcaaaagtc 

atcaagaaac 

ttgcgtccat 

taaaaaagta 

ttaacctctg 

cccaggtcga 

gtagtaagga 

tttgtatttt 

tttttttagt 

gtctgtgttt 

ttagttgttg 

atgacaggat 

gtagaaagta 

ttgtagcaat 

cgttgaatga 

tttaaacaat 

gcttagaaca 

tacatgttaa 

aattttgttg 

ttagttttgt 

gttgcagtgg 

catgttgctt 

gtctatttta 

agcacctaat 

tcccatgcat 

acatcccaac 



tggatatgtc atttctgtaa 
ttcttcatca aagtacaaaa 
tttttaacag ttgtctatat 
ctgctgagac aagtaaatga 



2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3912 



<210> 4 

<211> 4173 

<212> DNA 

<213> Homo sapiens 



<400> 4 

gcggccgccg 

gggctcgggg 

cccgcgcgga 

ctcgcttgcc 

aacgactggc 

agtctaaatc 

cacggacctg 

ctgccttttg 

ccaagacttt 

gttcacgttt 

ggctaggatg 

attaaccgag 

tccctggttt 

ccgcttaaca 

caataacatc 

tttctcccaa 

cacaacgctg 

cctcagcaac 



actcggcgct 

aaaagtccag 

aacgcggacc 

gtgtccaccg 

cggggtacaa 

ccatcccgcc 

tgcacatgcc 

aaaactcctg 

gcttgaatgt 

tgttaaaact 

agctttgtta 

tcttccatac 

accccacagt 

aggattccca 

gcgaagactg 

aacaacttta 

catttggagg 

cttcaagaac 



tccccgccgc 
cagcgagagg 
aggcgcttcg 
ccgcggcccc 
atggttctga 
gcacatctcc 
tggaattgag 
aaaaccatcc 
ttacattttc 
ttggggtgtc 
tagcagcttg 
agaatagtga 
caacttacag 
gtaacctctc 
tggatgagct 
ctaacattaa 
aaaatcagat 
tctacatcaa 



ccgcccaggg 
gccgcgtgtc 
gcccggatgc 
ggcgccaaga 
agcacttgaa 
agacttcaat 
agacacagtt 
ctctggactc 
tgctcgctgt 
aggagttgag 
ccaattggtg 
gtgtccacaa 
agaagccacc 
tagtgacaca 
gcagcagctt 
ggaggtcggg 
taccgagatg 
ccacaaccaa 



agtccctgcc 
ccgggacggt 
cggatccagg 
acagagcctg 
gctcacaggc 
ttggctgaaa 
aaaagactcc 
tggaattcta 
cctacatatc 
cttgctcagc 
ctgggcctac 
ctttgcgtat 
actgttgatt 
caagtgcttc 
ttcaacttga 
ctggcaaacc 
actgattact 
attagcacta 



tggcaagctg 
tccaggggag 
agaaggggac 
gccgtgagtg 
aaaggctaag 
taattcatgc 
aagttgcttt 
cacagctcaa 
acaatatagt 
aagccagcat 
taatgacttc 
gtgaaattcg 
gcaatgacct 
tcttacagag 
ctgaactaga 
taacccagct 
gtctacaaga 
tttctgctca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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tgcttttgca ggcttaaaaa atctattaag gctccacctg aactccaaca aattgaaagt 114 0 

tattgatagt cgctggtttg attctacacc caacctggaa attctcatga tcggagaaaa 1200 

ccctgtgatt ggaattctgg atatgaactt caaacccctc gcaaatttga gaagcttagt 12 60 

tttggcagga atgtatctca ctgatattcc tggaaatgct ttggtgggtc tggatagcct 1320 

tgagagcctg tctttttatg ataacaaact ggttaaagtc cctcaacttg ccctgcaaaa 1380 

agttccaaat ttgaaattct tagacctcaa caaaaacccc attcacaaaa tccaagaagg 14 40 

ggacttcaaa aatatgcttc ggttaaaaga actgggaatc aacaatatgg gcgagctcgt 1500 

ttctgtcgac cgctatgccc tggataactt gcctgaactc acaaagctgg aagccaccaa 15 60 

taaccctaaa ctctcttaca tccaccgctt ggctttccga agtgtccctg ctctggaaag 1620 

cttgatgctg aacaacaatg ccttgaatgc catttaccaa aagacagtcg aatccctccc 1680 

caatctgcgt gagatcagta tccatagcaa tcccctcagg tgtgactgtg tgatccactg 1740 

gattaactcc aacaaaacca acatccgctt catggagccc ctgtccatgt tctgtgccat 18 00 

gccgcccgaa tataaagggc accaggtgaa ggaagtttta atccaggatt cgagtgaaca 18 60 

gtgcctccca atgatatctc acgacagctt cccaaatcgt ttaaacgtgg atatcggcac 1920 

gacggttttc ctagactgtc gagccatggc tgagccagaa cctgaaattt actgggtcac 1980 

tcccattgga aataagataa ctgtggaaac cctttcagat aaatacaagc taagtagcga 204 0 

aggtaccttg gaaatatcta acatacaaat tgaagactca ggaagataca catgtgttgc 2100 

ccagaatgtc caaggggcag acactcgggt ggcaacaatt aaggttaacg ggacccttct 2160 

ggatggtacc caggtgctaa aaatatacgt caagcagaca gaatcccatt ccatcttagt 2220 

gtcctggaaa gttaattcca atgtcatgac gtcaaactta aaatggtcgt ctgccaccat 2280 

gaagattgat aaccctcaca taacatatac tgccagggtc ccagtcgatg tccatgaata 2340 

caacctaacg catctgcagc cttccacaga ttatgaagtg tgtctcacag tgtccaatat 2400 

tcatcagcag actcaaaagt catgcgtaaa tgtcacaacc aaaaatgccg ccttcgcagt 24 60 

ggacatctct gatcaagaaa ccagtacagc ccttgctgca gtaatggggt ctatgtttgc 2520 

cgtcattagc cttgcgtcca ttgctgtgta ctttgccaaa agatttaaga gaaaaaacta 2580 

ccaccactca ttaaaaaagt atatgcaaaa aacctcttca atcccactaa atgagctgta 2640 

cccaccactc attaacctct gggaaggtga cagcgagaaa gacaaagatg gttctgcaga 2700 

caccaagcca acccaggtcg acacatccag aagctattac atgtggtaac tcagaggata 27 60 

ttttgcttct ggtagtaagg agcacaaaga cgtttttgct ttattctgca aaagtgaaca 2820 

agttgaagac ttttgtattt ttgactttgc tagtttgtgg cagagtggag aggacgggtg 2880 

gatatttcaa atttttttag tatagcgtat cgcaagggtt tgacacggct gccagcgact 2940 

ctaggcttcc agtctgtgtt tggtttttat tcttatcatt attatgattg ttattatatt 3000 

attattttat tttagttgtt gtgctaaact caataatgct gttctaacta cagtgctcaa 3060 

taaaatgatt aatgacagga tggggttccc ctgtgctttt accagtagca tgaccccttc 3120 

tgaagccatc cgtagaaagt actttgtcct ccaaaaagct aacatacggt tttgaagcag 3180 

cattgaaact tttgtagcaa tctggtctat agacttttaa ctcaagaagc taaggctaga 3240 

cttgttacct tcgttgaatg atgttagttg actgtactgt aatgttgtat caactgaatt 3300 

gaatgtttgc ctttaaacaa tgaattttct ttttctttcc tttttttttt tttgttgtaa 3360 

tagttaaaga ggcttagaac aagctaacag gcaatagaaa tatgtatatc agatttttta 3420 

atgtaacaaa ctacatgtta attgttatct tattcttttt atctttagta gacactttta 34 80 

aaagaaaaga caattttgtt gtgtttaact caccaacacg tggtgtataa tgaagacaga 354 0 

actataataa attagttttg ttctgatttt ttagaacact tgcaataatg tatcatttat 3600 

agttcttgct agttgcagtg gtaatatttt tcacatccat aaaaacaact accaaaataa 3660 

atcagctgta gcatgttgct ttttaaagct aggccctaaa aggttttaat tctttttcta 3720 

agggaagaaa tgtctatttt aattaagata ttttaatgaa caggatttct gtattttaaa 3780 

tagtactgac tagcacctaa tgggcagtgg gagggtggtt catatgaaga aaaaaaaggt 3840 

gtattgttgt atcccatgca taaataaagg taaatatata tatacaccaa tatattcata 3900 

tatactcaca cacatcccaa cctgtcacac acaatgcgtg tgtatatata tatgaatata 3960 

ttggatatgt catttctgta agagttttgt taaaacctga ttttcttttg tagtatccac 4020 

attcttcatc aaagtacaaa aacgtctgtg gagtgtcaca aactgtatga catgttattt 4 080 

ctttttaaca gttgtctata tgcttagacc cgtgttagtc tctatatctg tgtggcaata 4140 

tctgctgaga caagtaaatg attaatagaa aac 4173 



<210> 5 

<211> 812 

<212> PRT 

<213> Homo sapiens 

<400> 5 
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Met Phe Gin Ala His Leu Gin Thr Phe Asn Leu Ala Glu lie lie His 
15 10 15 

Ala Thr Asp Leu Cys Thr Cys Leu Glu Leu Arg Asp Thr Val Lys Arg 
20 ~ 25 30 

Leu Gin Val Ala Phe Cys Leu Leu Lys Thr Pro Glu Asn His Pro Ser 
35 - 40 45 

Gly Leu Trp Asn Ser Thr Gin Leu Asn Gin Asp Phe Ala Val Asn Val 
50 55 60 

Tyr lie Phe Cys Ser Leu Ser Tyr lie Ser Gin Tyr Ser Val His Val 
65 70 75 80 

Leu Leu Lys Leu Trp Gly Val Arg Ser Glu Leu Ala Gin Gin Ala Ser 
85 90 95 

Met Ala Arg Met Ser Phe Val lie Ala Ala Cys Gin Leu Val Leu Gly 
100 105 110 

Leu Leu Met Thr Ser Leu Thr Glu Ser Ser lie Gin Asn Ser Glu Cys 
115 120 125 

Pro Gin Leu Cys Val Cys Glu lie Arg Pro Trp Phe Thr Pro Gin Ser 
130 135 140 

Thr Tyr Arg Glu Ala Thr Thr Val Asp Cys Asn Asp Leu Arg Leu Thr 
145 150 155 160 

Arg lie Pro Ser Asn Leu Ser Ser Asp Thr Gin Val Leu Leu Leu Gin 
165 170 175 

Ser Asn Asn lie Ala Lys Thr Val Asp Glu Leu Gin Gin Leu Phe Asn 
180 " 185 190 

Leu Thr Glu Leu Asp Phe Ser Gin Asn Asn Phe Thr Asn lie Lys Glu 
195 200 205 

Val Gly Leu Ala Asn Leu Thr Gin Leu Thr Thr Leu His Leu Glu Glu 
210 215 220 

Asn Gin lie Thr Glu Met Thr Asp Tyr Cys Leu Gin Asp Leu Ser Asn 
225 230 235 240 

Leu Gin Glu Leu Tyr He Asn His Asn Gin He Ser Thr He Ser Ala 
245 250 255 

His Ala Phe Ala Gly Leu Lys Asn Leu Leu Arg Leu His Leu Asn Ser 
260 ^ 265 270 

Asn Lys Leu Lys Val He Asp Ser Arg Trp Phe Asp Ser Thr Pro Asn 
275 280 285 

Leu Glu He Leu Met He Gly Glu Asn Pro Val He Gly He Leu Asp 
290 295 300 

Met Asn Phe Lys Pro Leu Ala Asn Leu Arg Ser Leu Val Leu Ala Gly 



305 



310 



315 



320 
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Met Tyr Leu Thr Asp lie Pro Gly Asn Ala Leu Val Gly Leu Asp Ser 
325 330 335 

Leu Glu Ser Leu Ser Phe Tyr Asp Asn Lys Leu Val Lys Val Pro Gin 
340 345 350 

Leu Ala Leu Gin Lys Val Pro Asn Leu Lys Phe Leu Asp Leu Asn Lys 
355 360 365 

Asn Pro lie His Lys lie Gin Glu Gly Asp Phe Lys Asn Met Leu Arg 
370 375 380 

Leu Lys Glu Leu Gly lie Asn Ash Met Gly Glu Leu Val Ser Val Asp 
385 390 ~ 395 400 

Arg Tyr Ala Leu Asp Asn Leu Pro Glu Leu Thr Lys Leu Glu Ala Thr 
405 410 415 

Asn Asn Pro Lys Leu Ser Tyr lie His Arg Leu Ala Phe Arg Ser Val 
420 425 430 

Pro Ala Leu Glu Ser Leu Met Leu Asn Asn Asn Ala Leu Asn Ala lie 
435 440 445 

Tyr Gin Lys Thr Val Glu Ser Leu Pro Asn Leu Arg Glu lie Ser lie 
450 455 460 

His Ser Asn Pro Leu Arg Cys Asp Cys Val lie His Trp lie Asn Ser 
465 470 475 480 

Asn Lys Thr Asn lie Arg Phe Met Glu Pro Leu Ser Met Phe Cys Ala 
485 490 495 

Met Pro Pro Glu Tyr Lys Gly His Gin Val Lys Glu Val Leu lie Gin 
500 505 510 

Asp Ser Ser Glu Gin Cys Leu Pro Met lie Ser His Asp Ser Phe Pro 
515 520 525 

Asn Arg Leu Asn Val Asp lie Gly Thr Thr Val Phe Leu Asp Cys Arg 
530 535 540 

Ala Met Ala Glu Pro Glu Pro Glu lie Tyr Trp Val Thr Pro lie Gly 
545 550 555 560 

Asn Lys lie Thr Val Glu Thr Leu Ser Asp Lys Tyr Lys Leu Ser Ser 
565 570 575 

Glu Gly Thr Leu Glu lie Ser Asn lie Gin lie Glu Asp Ser Gly Arg 
580 585 590 

Tyr Thr Cys Val Ala Gin Asn Val Gin Gly Ala Asp Thr Arg Val Ala 
595 600 605 

Thr lie Lys Val Asn Gly Thr Leu Leu Asp Gly Thr Gin Val Leu Lys 
610 615 620 

lie Tyr Val Lys Gin Thr Glu Ser His Ser lie Leu Val Ser Trp Lys 



9 



WO 03/104438 



• 



PCT/US03/19741 



625 



630 



635 



640 



Val Asn Ser Asn Val Met Thr Ser Asn Leu Lys Trp Ser Ser Ala Thr 
645 650 655 

Met Lys He Asp Asn Pro His He Thr Tyr Thr Ala Arg Val Pro Val 
660 665 670 

Asp Val His Glu Tyr Asn Leu Thr His Leu Gin Pro Ser Thr Asp Tyr 
675 680 685 

Glu Val Cys Leu Thr Val Ser Asn He His Gin Gin Thr Gin Lys Ser 
690 695 700 

Cys Val Asn Val Thr Thr Lys Asn Ala Ala Phe Ala Val Asp He Ser 
705 710 715 720 

Asp Gin Glu Thr Ser Thr Ala Leu Ala Ala Val Met Gly Ser Met Phe 
725 730 735 

Ala Val He Ser Leu Ala Ser He Ala Val Tyr Phe Ala Lys Arg Phe 
740 745 750 

Lys Arg Lys Asn Tyr His His Ser Leu Lys Lys Tyr Met Gin Lys Thr 
755 760 765 

Ser Ser He Pro Leu Asn Glu Leu Tyr Pro Pro Leu He Asn Leu Trp 
770 775 780 

Glu Gly Asp Ser Glu Lys Asp Lys Asp Gly Ser Ala Asp Thr Lys Pro 
785 ~ ' 790 795 800 

Thr Gin Val Asp Thr Ser Arg Ser Tyr Tyr Met Trp 



<210> 6 

<211> 716 

<212> PRT 

<213> Homo sapiens 

<400> 6 

Met Ala Arg Met Ser Phe Val He Ala Ala Cys Gin Leu Val Leu Gly 
15 10 15 

Leu Leu Met Thr Ser Leu Thr Glu Ser Ser He Gin Asn Ser Glu Cys 
20 25 30 

Pro Gin Leu Cys Val Cys Glu He Arg Pro Trp Phe Thr Pro Gin Ser 
35 40 45 

Thr Tyr Arg Glu Ala Thr Thr Val Asp Cys Asn Asp Leu Arg Leu Thr 
50 55 60 

Arg He Pro Ser Asn Leu Ser Ser Asp Thr Gin Val Leu Leu Leu Gin 
65 70 75 80 

Ser Asn Asn He Ala Lys Thr Val Asp Glu Leu Gin Gin Leu Phe Asn 



805 



810 



85 



90 



95 
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Leu Thr Glu Leu Asp Phe Ser Gin Asn Asn Phe Thr Asn lie Lys Glu 
100 105 110 

Val Gly Leu Ala Asn Leu Thr Gin Leu Thr Thr Leu His Leu Glu Glu 
115 120 125 

Asn Gin lie Thr Glu Met Thr Asp Tyr Cys Leu Gin Asp Leu Ser Asn 
130 135 140 

Leu Gin Glu Leu Tyr lie Asn His Asn Gin lie Ser Thr lie Ser Ala 
145 " 150 155 160 

His Ala Phe Ala Gly Leu Lys Asn Leu Leu Arg Leu His Leu Asn Ser 
165 170 175 

Asn Lys Leu Lys Val lie Asp Ser Arg Trp Phe Asp Ser Thr Pro Asn 
180 185 190 

Leu Glu lie Leu Met lie Gly Glu Asn Pro Val lie Gly lie Leu Asp 
195 200 205 

Met Asn Phe Lys Pro Leu Ala Asn Leu Arg Ser Leu Val Leu Ala Gly 
210 "* 215 220 

Met Tyr Leu Thr Asp lie Pro Gly Asn Ala Leu Val Gly Leu Asp Ser 
225 230 235 240 

Leu Glu Ser Leu Ser Phe Tyr Asp Asn Lys Leu Val Lys Val Pro Gin 
245 250 255 

Leu Ala Leu Gin Lys Val Pro Asn Leu Lys Phe Leu Asp Leu Asn Lys 
260 ^ 265 270 

Asn Pro lie His Lys lie Gin Glu Gly Asp Phe Lys Asn Met Leu Arg 
275 * 280 285 

Leu Lys Glu Leu Gly lie Asn Asn Met Gly Glu Leu Val Ser Val Asp 
290 295 300 

Arg Tyr Ala Leu Asp Asn Leu Pro Glu Leu Thr Lys Leu Glu Ala Thr 
305 310 315 320 

Asn Asn Pro Lys Leu Ser Tyr lie His Arg Leu Ala Phe Arg Ser Val 
325 330 335 

Pro Ala Leu Glu Ser Leu Met Leu Asn Asn Asn Ala Leu Asn Ala lie 
340 345 350 

Tyr Gin Lys Thr Val Glu Ser Leu Pro Asn Leu Arg Glu lie Ser lie 
355 360 365 

His Ser Asn Pro Leu Arg Cys Asp Cys Val lie His Trp lie Asn Ser 
370 375 380 

Asn Lys Thr Asn lie Arg Phe Met Glu Pro Leu Ser Met Phe Cys Ala 
385 390 395 400 

Met Pro Pro Glu Tyr Lys Gly His Gin Val Lys Glu Val Leu lie Gin 
405 ~ 410 415 
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Asp Ser Ser Glu Gin Cys Leu Pro Met He Ser His Asp Ser Phe Pro 
420 425 430 

Asn Arg Leu Asn Val Asp He Gly Thr Thr Val Phe Leu Asp Cys Arg 
435 440 445 

Ala Met Ala Glu Pro Glu Pro Glu He Tyr Trp Val Thr Pro He Gly 
450 455 460 

Asn Lys He Thr Val Glu Thr Leu Ser Asp Lys Tyr Lys Leu Ser Ser 
465 470 475 480 

Glu Gly Thr Leu Glu He Ser Asn He Gin He Glu Asp Ser Gly Arg 
485 490 495 

Tyr Thr Cys Val Ala Gin Asn Val Gin Gly Ala Asp Thr Arg Val Ala 
500 505 510 

Thr He Lys Val Asn Gly Thr Leu Leu Asp Gly Thr Gin Val Leu Lys 
515 520 525 

He Tyr Val Lys Gin Thr Glu Ser His Ser He Leu Val Ser Trp Lys 
530 J 535 54 0 

Val Asn Ser Asn Val Met Thr Ser Asn Leu Lys Trp Ser Ser Ala Thr 
545 550 555 560 

Met Lys He Asp Asn Pro His He Thr Tyr Thr Ala Arg Val Pro Val 
565 570 575 

Asp Val His Glu Tyr Asn Leu Thr His Leu Gin Pro Ser Thr Asp Tyr 
580 585 590 

Glu Val Cys Leu Thr Val Ser Asn He His Gin Gin Thr Gin Lys Ser 
595 600 605 

Cys Val Asn Val Thr Thr Lys Asn Ala Ala Phe Ala Val Asp He Ser 
610 615 620 

Asp Gin Glu Thr Ser Thr Ala Leu Ala Ala Val Met Gly Ser Met Phe 
"625 630 635 640 

Ala Val He Ser Leu Ala Ser He Ala Val Tyr Phe Ala Lys Arg Phe 
645 650 655 

Lys Arg Lys Asn Tyr His His Ser Leu Lys Lys Tyr Met Gin Lys Thr 
660 665 670 

Ser Ser He Pro Leu Asn Glu Leu Tyr Pro Pro Leu He Asn Leu Trp 
675 680 685 

Glu Gly Asp Ser Glu Lys Asp Lys Asp Gly Ser Ala Asp Thr Lys Pro 
690 695 700 

Thr Gin Val Asp Thr Ser Arg Ser Tyr Tyr Met Trp 
705 710 715 
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